

# THE HARDWARE OF THE KDF9

by BILL FINDLAY

## 1: KDF9

### 1.1: BACKGROUND AND OVERVIEW

Announced in 1960, the English Electric KDF9 [Davis60] was one of the most successful products of the early UK computer industry. Even in an era of architectural experimentation, the designers of the KDF9 were bold and innovative. The CPU used separate hardware stacks for expression evaluation and for subroutine linkage. Its three concurrently-running control units shared the work of fetching, decoding, and executing machine code instructions, synchronized by a variety of hardware interlocks.

The KDF9 was one of the earliest fully hardware-secured multiprogramming systems. Up to four programs could be run at once under the control of its elegantly simple operating system, the **Timesharing Director**. Each program was confined to its own core area by hardware relocation, and had its own sets of working registers, which were activated when the program was dispatched, so that context switching was efficient. A program could drive I/O devices directly, but was limited by hardware checks to those the Director had allocated to it.

This paper is one of a series. Companion papers provide a synoptic description of the KDF9's architecture and software [FindlayEE]; a detailed description of the KDF9's various software systems [FindlaySW]; and an account of KDF's rôle in the development of computer performance benchmarking [FindlayBM]. Here I focus on the hardware, especially on the instruction set and its implementation. It may be useful to read [FindlayEE] before the present work, which deepens that account, but it is not in the least necessary, as all essential material is repeated.

My primary sources are EE internal engineering reports [EEC62a, b, c], and the nearest thing we have to a KDF9 reference manual: the *KDF9 Programming Manual* [ICL69], called simply 'the Manual' in the following.

A comprehensive bibliography of extant KDF9 documents is provided in [FindlayKB], which is updated from time to time as new sources come to light. Much of this material is available online.

### 1.2: AUTONOMOUS UNITS AND THEIR STORES

A KDF9 computer has three primary and many secondary control units, all microprogrammed and running in parallel with each other, subject to appropriate interlocks. A useful way to think of its large-scale structure is as a community of cooperating finite-state machines. The whole complex works to the beat of a clock giving two pulse trains (P1 and P2) each of  $0.25\mu\text{s}/0.75\mu\text{s}$  mark/space ratio, and  $0.5\mu\text{s}$  out of phase.

The primary control units are **Main Control** (MC), **Arithmetic Control** (AC), and **I/O Control** (IOC). According to a personal communication from Michael Wetherfield—one of the designers of the KDF9 architecture—this division of responsibility was suggested, at least in part, by somewhat earlier plans for the ILLIAC II [UofI58].

MC takes most of the responsibility for instruction sequencing, including instruction fetching, jumps, subroutines, and interrupts; for mediating access to the KDF9's various stores; and for address generation and indexing. The **main (core) store** has up to 32768 words, each of 48 bits. The **Q Store** is a bank of 16 index registers, each of 48 bits. The **Subroutine Jump Nesting Store (SJNS)** is a stack of 16-bit return addresses, with maximum depth 17 **links**.

AC directly executes simple ALU instructions, and delegates to the Multiplier/Divider unit and the Shift unit, for more complex arithmetic operations. AC operands are taken from, and results returned to, the **nesting store** (or **nest**), which is an expression-evaluation stack with maximum depth 19 words. In addition there is a 1-bit **Overflow Register**, typically set when the range of a result is exceeded, and a 1-bit Boolean **Test Register**.

Note the absence of a conventional 'program counter' register. This is explained later.

IOC supervises up to 16 DMA channels (known as **buffers** in KDF9 terminology) that are capable of simultaneous, independent transfers. Each buffer operates under the control of its own autonomous microprogram. IOC is responsible for the **Lock-Out Store**—one bit to every 32 words of main store, for mutual exclusion of CPU and I/O transfers.

The Lock-Out Store, Q Store, SJNS and nest are implemented with the same technology: as core storage with a read cycle time of  $1\mu\text{s}$  and a write cycle time of  $1.5\mu\text{s}$ . The main store is comparatively slow, having a read or write cycle time of  $6\mu\text{s}$ , and the architecture of the KDF9 goes to some lengths to mitigate the effect this has on performance.

None of these stores have parity checking.

A hardware **timesharing** option replicates the nest, SJNS and Q Store, so that four register sets are provided, thereby obviating the need to save and restore 48 registers when context-switching between a maximum of four concurrently running processes or **problem programs**. KDF9 runs under the supervision of a Director program, a set of privileged routines normally resident at the bottom of store. Its main function is to respond to interrupts, and thereby provide essential services to problem programs, including management of the timesharing facility. (In the UK computer parlance of the early 1960s, a 'timesharing' system implements multiprogramming, not multi-user interactivity.)

A more unusual type of storage holds the microprogram in the **sequence units**, which direct the actions of MC, AC, IOC, and the buffers. The bits in these stores are fabricated by **pulse transformers**. These are large ferrite toroids, about 1cm in diameter, with multiple secondary windings. When a transformer's primary winding is pulsed, its secondary windings emit a variety of control signals.

The fastest storage elements, registers accessed in critical paths, are made from transistor flip-flops. Transistors are also used as amplifiers for the outputs from core stores and pulse transformers. A KDF9 includes something on the order of 20,000 transistors—a remarkably low figure for such a sophisticated machine, and testimony to its philosophy of gaining speed through clever design instead of heroic expenditure on hardware. Perhaps we glimpse the ghost of Turing, whose computer designs were predicated on exactly the same attitude.

### 1.3: DATA FORMATS

A KDF9 data word consists of 48 bits numbered from D0 (the most significant bit) to D47 (the least significant). Similar big endian numbering is applied to part-word bit fields. Several data formats are supported:

- as a 48-bit, 2's complement binary integer, fraction, or fixed-point number
- as a 48-bit floating point number, consisting of a sign in D0, an excess-128 exponent in D1-D8, and a mantissa in D9 to D47; where the mantissa, together with D0, forms a 2's complement fraction of 40 bits
- as half of a 96-bit, 2's complement binary integer, fraction, or fixed-point number, where D48 (D0 of the second word) is set to zero, so that the effective capacity is 95 bits
- as half of a 96-bit floating point number, consisting of sign in D0, excess-128 exponent in D1-D8, a first part of the mantissa in D9 to D47, D48 set to zero, and a second part of the mantissa in D57-D95; where D49-D56 are set equal to (exponent – 39) unless that would be negative, in which case the whole of D48-D95 is set to 0; and where the mantissa, together with D0, forms a 2's complement fraction of 79 bits
- as two 24-bit **halfwords**
- as eight characters of 6 bits each
- as three 16-bit 2's complement integers that constitute the **Counter (C-part, D0-D15)**, the **Increment (I-part, D16-D31)** and the **Modifier (M-part, D32-D47)** of an indexing word in **Q Store format**
- as a control word for I/O instructions, in which the C-part is a device number; the I-part a main store start-address, or unused; and the M-part a main store end-address, or an operation count, or unused

Single-precision floating-point results are usually rounded: 1 is added to D47 of the result if the first truncated bit is not 0; but double-precision results are always unrounded: the truncated bits are ignored. Floating-point results are always **standardised** (normalised, so that D0 ≠ D9); the only floating point operation that gives a well defined result from a non-standardised operand is the STAND instruction, which effects normalisation.

There are no double-word fetch or store instructions: double words need a pair of fetch or store instructions to copy them into or out of the nest. The less significant word of a pair is held in the deeper of its two nest cells.

There are instructions for fetching and storing 24-bit halfwords. Fetching a halfword expands it to a full word in the nest by filling in the 24 least significant bits with zeros. Storing a halfword from a word in the nest extracts its most significant 24 bits. A halfword can hold a 24-bit 2's complement binary integer, fraction, or fixed-point number, or a floating-point number (the latter consisting of the most significant 24 bits of the 48-bit floating-point format).

I/O devices employ 6-bit characters, up to eight of which can be held in one word; however there are no facilities to address packed characters in main store. Characters are packed into words beginning at the most significant six bits. Conveniently, in all the KDF9 character codes, six zero bits represent a blank (space) character; and six one bits represent a **filler** character, which legible output devices such as the line printer and Flexowriter completely suppress. For a full listing of the character codes used by the various I/O devices, see Appendix 4.

### 1.4: INSTRUCTION FORMATS

A KDF9 instruction consists of one, two or three **syllables** of eight bits, the first two bits of each instruction giving its type, which is 00<sub>2</sub> for one-syllable instructions, 01<sub>2</sub> for two-syllable instructions, 10<sub>2</sub> for three-syllable jumps, and 11<sub>2</sub> for 'directly addressed' fetch and store instructions of three syllables. Each instruction word is therefore capable of holding between two and six instructions, dependent on their lengths.

One-syllable instructions do not contain an operand address or other parameter, and operate only on the nest in 'Reverse Polish' style. They are carried out by AC (Arithmetic Control). Instructions of two or three syllables are primarily the responsibility of MC (Main Control). Two-syllable operations include all operations that require one or more Q Store numbers—'indirect' memory fetches and stores, operations on the contents of Q Stores, shift orders, I/O orders, and the special JrCpNZS jump instruction. Three-syllable jumps contain a 16-bit instruction address. Directly addressed fetch and store orders contain a 15-bit word address, and the SET instruction contains a 16-bit constant.

The KDF9 assembly language, called **Usercode** [EEC69], is very unusual in having a 'distributed' syntax that embeds parameters within the Usercode order. For example, the J10C2NZ instruction means 'Jump to label **10** if the **C**-part of Q Store **2** is Not Zero'. It is possible that this format was suggested by the order code, which distributes opcode and address bits around the machine instruction in an equally unconventional manner—see Appendix 2.

Usercode instructions are labelled by integers. An asterisk preceding a label forces the following, labelled, instruction to start at syllable 0 of a fresh word, any unused syllables of the preceding word being padded with DUMMY (no-op) instructions. This is necessary for the target label of a JrCpNZS instruction, which does not contain an address, but jumps to syllable 0 of the word before that containing the JrCpNZS itself.

## 2: THE CPU

### 2.1: MAIN CONTROL, ARITHMETIC CONTROL AND I/O CONTROL

KDF9 comprises two blocks: the I/O block and the Computing block. The I/O block is provided with instructions by the Computing block, but otherwise works independently, taking priority for access to main store. The Computing block is composed of MC and AC. MC works collaboratively with AC. They interact most strongly on conditional jumps, where AC computes the Boolean; on shifts, where MC calculates the shift length and AC performs the shift; and on fetches/stores, where MC transfers data to/from the store and AC transfers it to/from the nest. The latter, in particular, does much to compensate for the relatively slow core store.

For a fetch operation, MC initiates a store read. A word is transferred from the store to whichever of two fetch buffers is empty. If both contain data that has not yet been delivered to AC, the transfer is delayed until AC catches up. The fetch buffers mean that MC can complete up to two fetch instructions, their core cycles being overlapped with computation by AC. This is particularly valuable in the very frequently executed inner loops that evaluate scalar products. (KDF9 was very much a 'scientific' computer.) For a store operation, MC saves the destination address in a register; when AC catches up it copies the value from the nest into a store buffer register. When convenient, MC copies the contents of the buffer register to store. Instruction fetching is also the responsibility of MC. There are two instruction-word buffers. MC fetches an instruction word to the instruction-word buffer that is not currently being inspected by AC. Thus instruction-fetching can also overlap with computation by AC.

The art of ‘optimum programming’ on KDF9 is concerned with the careful relative placement of fetch, store, jump and arithmetic operations, with a view to maximizing the parallelism between store cycles and calculation. In this respect KDF9 is very like modern CPUs.

## 2.2: ADDRESSING AND THE Q STORE

Problem programs address their instructions and data starting from a virtual effective address of 0. When store is accessed, the hardware offsets this address by the program’s starting position in main store. At the same time it checks that the virtual address does not exceed the number of locations allocated to the program by Director. In Director state no offsetting is done, so that Director starts at physical location 0.

The address part of a jump instruction consists of a 13-bit word number and the 3-bit number of a syllable within that word. Consequently, the instructions of a program are confined to its first 8192 words. EXIT (return from subroutine), EXITD (return from Director to a problem program) and OUT (invoke Director from a problem program) all have the same basic format as a jump. EXIT’s address part is treated as an offset, to be added to the link in the SJNS.

Effective addresses for data fetches and stores are generated in either of two ways: by 3-syllable instructions containing a 15-bit constant address part and a single Q Store reference,  $Qq$ ; or by 2-syllable instructions containing two Q Store references,  $Qk$  and  $Qq$ . The effective address in the first case is the sum of the constant and the contents of  $Mq$ ; in the second case it is the sum of  $Mk$  and  $Mq$ .  $Q0$  always contains 0, providing a handy way of getting a zero base.

Data locations in EE Usercode have rigidly stereotyped identifiers. Local variables of subroutines have names of the form  $Vm$ ; global variables have names of the form  $Wm$ ,  $Ym$ ,  $YAm$ , ...,  $YZm$ . If indexed, the identifier is followed by ‘M’ and the Q Store number; the 2-syllable instructions take the form ‘ $MkMq$ ’. Usercode instructions such as  $M1M2$ ;  $V9$ ;  $YA7M2$ ; and so on, represent data fetches that push values onto the nest, while  $=M1M2$ ;  $=V9$ ;  $=YA7M2$ ; and so on, represent data stores that pop values from the nest.

Flags suffixed to the instruction optionally specify index updating, halfword addressing, and ‘next word’ addressing.

The ‘Q’ suffix causes the  $Qq$  register to be updated, *after* the effective address is determined, by adding the contents of  $Iq$  to  $Mq$  and decrementing the contents of  $Cq$ . This allows stepping through a predetermined (but variable) number of locations, at addresses given by an initial address and a predetermined (but variable) stride. There are conditional jump instructions that test whether the C-part of a Q register is (non-)zero, providing for very efficient counting loops.

The ‘H’ suffix, on a 2-syllable fetch or store order, causes the operand accessed to be a halfword. In this case the content of  $Mk$  is taken as a base word address, and the content of  $Mq$  is taken as a halfword offset, odd-numbered halfwords being those in D24-D47 of the addressed word.

The ‘N’ suffix, on a 2-syllable fetch or store order, causes the accessed word to be at an address 1 greater than usual (i.e., the next word). This allows efficient processing of arrays of pairs of elements stored sequentially in adjacent words —such as the constituent words of a double-precision number—since  $Qq$  needs to be updated only once for every word pair, using an increment part set to 2. All combinations of Q, H, and N, in that order, are permitted in a 2-syllable fetch or store order.

The 3-syllable fetch and store orders take  $6\mu s$ , the 2-syllable fetch and store orders take  $7\mu s$ , and an additional  $1\mu s$  is needed for Q register updating in both cases. It is merely coincidental that the time taken by 3-syllable fetch and store orders is the same as the main store read/write cycle time—the core store cycle does not begin until  $1.5\mu s$ - $3.5\mu s$  after the start of the instruction and the  $6\mu s$  it takes is *not* included in the total, as it overlaps with following instructions.

## 2.3: EXPRESSION EVALUATION AND THE NEST

An adder associated with the Q Store carries out the arithmetic involved in Q Store updating and in computing effective addresses. All other calculations, both fixed-point and floating-point, involve the nest. It is common for parameters of a subroutine to be supplied in the nest, especially when the routine implements an arithmetic function.

The top three cells of the nest (N1 at the top, then N2 and N3 as we go deeper) are held in three flip-flop registers. The N-registers are managed, transparently to problem programs, by a combination of hardware logic and Director software. When a value is pushed onto the nest, hardware saves N3 in the nest’s core stack, N2 in N3, N1 in N2, and the new value in the vacated N1. Popping a value is the simple converse.

At most 16 nest cells are available to a program. The **Nest Over-/Under-flow Violation** (NOUV) interrupt happens after pushing a 17th value, or after popping an empty nest. Since an interrupt is not effected until it is convenient for MC, AC may have performed several further nest operations, leaving the contents of the nest unpredictable. Consequently, no recovery from NOUV is possible for a problem program.

Keeping track of nest depth is one of a KDF9 programmer’s main chores, and NOUV interrupts are among the most common results of programming error. In a modular program with deeply nested subroutines it can be difficult to ensure that NOUV is always avoided. Routines can save (on entry) and restore (before exit), all or part of the nest contents; they are helped in this by conditional jumps that test whether the nest is empty ( $JrEN$ ) or not ( $JrNEN$ ).

It often happens that the order of operands in the nest, while convenient for some purposes, is inconvenient for others. To reorganize the nest, we have the 1-syllable instructions:

- REV:  $a, b, \dots \rightarrow b, a, \dots$
- DUP:  $a, \dots \rightarrow a, a, \dots$
- ERASE:  $a, \dots \rightarrow \dots$
- CAB:  $a, b, c, \dots \rightarrow c, a, b, \dots$
- PERM:  $a, b, c, \dots \rightarrow b, c, a, \dots$
- REVD:  $a, b, c, d, \dots \rightarrow c, d, a, b, \dots$
- DUPD:  $a, b, \dots \rightarrow a, b, a, b, \dots$

Simple instructions—including integer  $+$ ,  $-$ , AND, OR, REV, DUP, ERASE, *etc*—take from  $1\mu s$  to  $4\mu s$ ,  $2\mu s$  being typical. Floating-point addition takes from  $7\mu s$ , multiplication takes from  $14\mu s$ , and division takes from  $36\mu s$ ; the variation being due to data-dependent operand alignment and result normalization. Overflow is set on division by zero and when a numerical result is outside the range of the result type. It remains set until explicitly cleared.

The 1-syllable ZERO and the 3-syllable SET allow for the sourcing of small constants. The most efficient way to push -1 is: ZERO; NOT; and the most efficient way to get +1 is: ZERO; NOT; NEG;—the same number of syllables as SET 1; but slightly faster. Subtracting 1 is best done by: NEG; NOT; and adding 1 by: NOT; NEG. It is nice that these ‘hacks’ set the overflow in exactly the same cases as SET 1;  $+$ ; or SET 1;  $-$ . MAX arranges that  $N1 \geq N2$ , setting

overflow if they had to be exchanged. SIGN compares N1 with N2 and yields  $-1, 0$ , or  $+1$ , according as the sign of  $(N2-N1)$ , while avoiding the possibility of an overflow. BITS replaces N1 with a count of the number of 1-bits it contained. TOB yields a binary integer from a mixed-radix integer and a radix pattern; and FRB does the converse. For a full list of instructions, with execution times, see Appendix 1.

#### 2.4: CONTROL FLOW AND THE SJNS

The  $Jr$  instruction is an unconditional jump to the instruction at label  $r$ . The instructions:  $Jr=Z$ ,  $Jr\neq Z$ ,  $Jr>Z$ ,  $Jr\geq Z$ ,  $Jr\leq Z$  test the sign of the top cell of the nest; to compare the top two cells of the nest we have:  $Jr=$  and  $Jr\neq$ . All of these pop N1 whether they jump or not;  $Jr=$  and  $Jr\neq$  do not pop N2, being the only dyadic operations with this behaviour. To test whether  $Cq$  is (non-)zero we have:  $JrCqZ$ ,  $JrCqNZ$ , and  $JrCqNZS$ . The  $JrV$  and  $JrNV$  instructions are conditional on the Overflow Register being (un-)set, while  $JrTR$  and  $JrNTR$  are conditional on the Test Register being (un-)set.  $Jr(N)V$  and  $Jr(N)TR$  clear the designated register, whether they jump or not.

$JrCqNZS$  is known as the **short loop jump**. It jumps, if  $Cq$  is nonzero, to syllable 0 of the instruction word that precedes the word containing the  $JrCqNZS$  instruction; and has the further effect of inhibiting instruction fetch cycles. Thus the loop executes entirely from the instruction word buffers, with no overhead for instruction fetches. Important algorithms, such as scalar product and polynomial evaluation, fit comfortably into the 12 available syllables.

The  $JSr$  instruction branches unconditionally to the instruction at label  $r$ , and pushes its own address onto the SJNS as a return **link**. This creates issues of SJNS management similar to those that arise with the nest, and the instructions  $JrEJ$  and  $JrNEJ$  are provided analogously to  $JrEN$  and  $JrNEN$ . The only other instructions that access the SJNS are:

- **EXIT  $a$** : complementary to  $JSr$ —pops the link from the SJNS, adds the constant  $a$  (which is an integral multiple of three syllables), and branches to the resultant address
- **OUT**: pushes its own address onto the SJNS and causes an OUT interrupt, thereby switching control to Director
- **EXITD**: complementary to OUT, but also used when context-switching—pops the link from the SJNS and branches to the resultant address in program state, with interrupts enabled on completion
- **LINK**: pops a link from the SJNS and pushes it onto the nest
- **=LINK**: pops a link from the nest and pushes it onto the SJNS

Normal return from a subroutine is effected by EXIT 1; if there is an abnormal return path, it is taken by EXIT 1; and EXIT 2; is the normal return. Switches are programmed by putting the index into the SJNS, by means of the  $=LINK$  instruction, then the  $EXIT ARr$  instruction jumps to the selected word in the jump table starting at label  $r$ .

All interrupts, like OUT, use the SJNS for their return address.

### 3: INPUT/OUTPUT

#### 3.1: PERIPHERAL DEVICES

Up to 16 I/O buffers can be fitted. Each has its own independent control unit, which is microprogrammed to support its connected device. The monitor console Flexowriter is always on buffer 0, and a paper-tape reader with hardware bootstrap facility is always on buffer 1. Communication with the operators is by means of messages typed on the Flexowriter, which has a button that is pressed to gain the attention of Director by causing a typewriter interrupt. (Merely using the keyboard does not cause an interrupt.)

Peripheral devices fall into two classes: **slow**, or **character** devices; and **fast**, or **word** devices. Buffers for slow devices do a core cycle for each character transferred; fast device buffers do a core cycle for each complete word.

The slow devices include the Flexowriter (10ch/s), paper tape punches (110ch/s), paper tape readers (1000ch/s), card readers (10 card/s), card punches (5 card/s), graph plotters (200 step/s), and line printers (15 line/s).

The native KDF9 paper tape code is rather odd, and makes poor use of the eight punchable locations, or ‘channels’, in a tape frame. Channels 1-4 and 6-7 encode the six-bit character and channel 5 establishes even parity for the whole frame. But channel 8 is used only to distinguish completely blank tape from the space (SP) character, which has the all-zero code. (Since SP has even parity it would otherwise be represented by a frame with no punching.) Blank tape, and erased tape with all channels punched, are treated as content-less leader or trailer. In normal input modes they are skipped over by the buffer and not transferred. These peculiarities are inherited from the offline Flexowriter tape-punch stations used for data preparation and for the transcription of output tapes to typed copy.

The paper tape code has ‘Case Shift’ and ‘Case Normal’ characters to expand the character set. Flexowriters treat them literally—on receiving a Case Shift character, they shift up the type bars so that an alternative glyph can be printed for each code. For example, the data ‘Bill Findlay’ would be punched on tape as ‘**B**BILL **ñ**FBINDLAY’, where I have used **B** to denote the non-printing Case Shift and **ñ** the non-printing Case Normal. Transcribing a tape containing the characters ‘**B**BILL **ñ**FBINDLAY’ types out the text ‘Bill Findlay’.

Calcomp drum plotters (models 563, 564, 565 and 566) are an option. They are attached to tape punch buffers, which are manually switchable between punch and plotter. An I/O instruction is provided to test the position of the switch. The plotter takes 6-bit codes that move the pen and/or the paper by one step of fixed size at a time. Depending on the particular Calcomp model, steps are of size 0.01 or 0.005 inches, and are taken at the rate of 200 steps/s. Commands to raise or lower the pen take 0.1s. The paper is a roll up to 120 feet in length, and either 11 inches or 29.5 inches wide, depending on the model. Strangely, according to the Manual, a step in only one of the diagonal directions is available as a single code; a step on the other diagonal requires separate pen and paper steps. This is almost certainly a typo; external evidence provides convincing reasons for, and the codes of, the other diagonals.

Card readers and punches have two modes of operation. In the ‘alphanumeric’ mode each column corresponds to one character and is encoded or decoded accordingly. In the ‘direct’ mode the 12 rows of each column correspond to the 12 bits of two characters and are transferred literally. Line printers use a subset of the alphanumeric card code.

The fast devices are magnetic tape decks, fixed-disc drives, and drum stores.

The magnetic tape physical format uses 16 tracks across the tape: six data tracks, a parity track and a clock track are duplicated for each character. A character is valid if either copy can be read without error. Tapes run at 100inch/s, with 400ch/inch, for a transfer rate of 40kch/s. Inter-block gaps are about 8-9mm long. A faster, but seldom seen, magnetic tape deck runs at double speed (80Kch/s), recording on a steel band instead of plastic tape.

The fixed-disc drive has 17 disc platters, measuring about 31 inches in diameter. Disc blocks hold 40 words (320 characters); there are 16 blocks per track in an outer recording zone (further from the spindle) and eight per track in an inner zone. The discs spin at 1000 rev/min: the transfer rate in the outer zone is about 85Kch/s, but only half that in the

inner zone. Fixed heads are mounted on the 17th platter, providing access to 96 of the fastest blocks without a seek. Each of the other 16 platters has its own independent actuator, which carries eight read/write heads. Both surfaces of a platter have two heads in the inner track zone and two in the outer zone, so that 96 blocks are available without a seek. Seek times range from 156ms to 367.5ms, with an average of 231ms. The heads can be moved to 64 different positions, giving a capacity of 6144 blocks per platter, or 98304 blocks (31,457,280 characters) per drive. Up to four such drives can be attached to a disc buffer.

A drum consists of 320 addressable sectors, each of 128 words, for a total of only 40960 words of storage, accessed at a transfer rate of 500kch/s. Up to four units may be fitted to a drum buffer. Drums are of use mainly for storing frequently executed programs, such as the program source editors and compilers. KDF9 configurations do not ordinarily include drums, which offer a mutually exclusive alternative to fixed-disc drives.

### 3.2: I/O INSTRUCTIONS

There are two significant aspects to the software control of I/O devices on KDF9: their allocation to problem programs by Director, and their control by problem programs once allocated.

Programs can directly drive I/O devices on buffers allocated to them by Director. Any attempt to access an unallocated buffer causes a **Lock-In Violation** (LIV) interrupt, and termination of the program. This feature of KDF9 means that programs are inherently device-dependent: they must contain logic specific to the type of device they use. This is less of a disadvantage than it might seem, because card reader/punches, paper tape reader/punches, and printers, all use somewhat different character sets; so programs have to be device-aware for that reason, if no other.

Though program logic is coupled to device type, it is decoupled from device identity. To obtain access to a device, a program asks Director to allocate it one of the type. If such a device is available, Director allocates it to the program and returns its buffer number in N1. The program must save this number for future use.

The instructions that control devices take their parameters from a designated Q Store,  $Q_q$ .  $C_q$  contains the buffer number of the device. (For the fixed disc system,  $C_q$  also contains the seek area number, the platter number and the drive number, taking up all 16 bits.) Depending on the specific instruction,  $I_q$  and  $M_q$  might contain starting and ending *virtual* addresses for a data transfer operation, the transfer count being determined by their difference; or  $M_q$  might contain a repetition count for a control operation; or  $I_q$  and  $M_q$  might both be ignored.

If the least significant bit of an I/O instruction is 1, it sets the device off-line before initiating the operation. The device is only set on-line by the operator pressing a button on its control panel. In this way a program can set up a transfer on a tape reader, for example, which waits until the operator has loaded a tape before attempting to read.

After checking its validity, MC delegates an I/O instruction to its specified buffer, via IOC; if it is a data transfer instruction, MC also provides physical addresses converted from the virtual addresses given in  $I_q$  and  $M_q$ .

There is some attempt at making the effect of a given instruction generic, so that  $PIAQ_q$ , say, acts in a similar way on all input-capable devices, but this is not carried through with complete consistency. Some simpler devices respond to different orders in the same way:  $PIAQ_q$  and  $PIEQ_q$ , for example, have the same effect on a paper tape reader.

A completely generic feature of the I/O architecture is the provision of variable length transfer instructions. These take a (maximum) transfer count, as usual, but terminate early upon transferring an **End Message** character (written ' $\rightarrow$ ', code  $75_8$ ). When a variable-length read terminates on End Message, any remaining character positions of the last word transferred are set to zero (a SP character). In the case of fast devices, when a variable-length write terminates on End Message, any remaining character positions of the last *block* transferred are set to zero on the output device.

Another feature, generic to the slow devices only, is the availability of the misnamed **character** transfer option. These instructions read one character into, or write one character out of, the least significant bits of each word in the transferred area. This is the only I/O mode that permits the transfer of an arbitrary number of characters without the side effect of termination on End Message. When applied to paper tape, character mode further allows for all eight bits of a frame to be read or written, so that foreign codes can be handled (at the cost of forgoing hardware parity checks).

A peculiarity of the online Flexowriter is that, when a semicolon is written, the write operation changes itself on-the-fly to a read operation; subsequent typed input is transferred to the originally designated output buffer, in character positions following the semicolon. This can be used to program an atomic, prompt-and-response, form of interaction.

The  $PM_xQ_q$  instructions provide control operations, such as rewinding a magnetic tape or initiating a disc arm seek.

Several instructions enable device-dependent state (e.g., whether a magnetic tape is positioned at the Beginning Of Tape window) to be transferred to the CPU's Test Register; these instructions are also encoded in the  $PM_xQ_q$  group.

$PARQ_q$  and  $BUSYQ_q$  both transfer state from a buffer to the Test Register.  $BUSYQ_q$  checks whether a buffer is presently engaged in a transfer.  $PARQ_q$  checks and clears a parity error flag on a buffer's last completed transfer—any other command to a buffer with an uncleared parity error causes a LIV interrupt.

The Usercode programmer is given multiple, device-specific mnemonics for each hardware I/O instruction. For example, the output instruction  $POAQ_q$  can also be written as  $MWQ_q$  when the intended output device is a magnetic tape drive, and as  $TWQ_q$  when it is the console typewriter. The use of a specific mnemonic has no significance at run time; effectively it acts like a comment appended to an unspecific mnemonic in the source program.

Only a fraction of many possible I/O opcodes have defined effects. Moreover, some are defined only for one class of device (input or output). Attempting an undefined I/O operation causes a LIV interrupt. See Appendix 3.

### 3.3: I/O-DRIVEN MULTIPROGRAMMING

Program blocking by I/O operations is automatically managed by IOC, using a 12-bit **Program Hold-Up** (PHU) register for each of the four program priority levels.

While an I/O transfer is in progress, its core store area is **locked out** by the buffer, so that any attempt to access it concurrently, whether to fetch or store data, to fetch instructions, or use it for another I/O transfer, causes a **Lock-Out Violation** (LOV) interrupt. Attempting an I/O operation on an already-busy buffer also causes a LOV. In response to a LOV, Director suspends the responsible program. One bit of the Lock-Out Store is associated with each **group**, or block of 32 words, so for greatest overlap it is important to ensure that I/O areas are aligned on 32-word boundaries.

The PHU registers are constituted as follows:

- $PHU_n:D11$  is set if program level  $n$  is held up.
- If  $PHU_n:D10$  is 1 then  $PHU_n:D6-D9$  contain the number of the buffer on which program level  $n$  is waiting.
- If  $PHU_n:D10$  is 0 then there is a core lock-out in effect and  $PHU_n:D0-D9$  contain the number of the locked-out group.

A buffer records the CPU's privilege state at the start of a transfer. On completion, if Director started the transfer, IOC requests an **End of Director Transfer** (EDT) interrupt; but if the transfer belonged to a problem program, its PHU is cleared. If the cleared PHU belongs to a program of higher priority than the one currently running, IOC requests a **Program** (PR) interrupt. Since a suspended program may be waiting on a shared device (typically, the Flexowriter) made busy by a program of lower priority, every other PHU is then examined to see if it refers to the same buffer. If so, an EDT interrupt is requested instead of PR. This enables Director to take suitable action to resolve the priority inversion over the shared device.

Applying an INTQ<sub>q</sub> instruction to a busy buffer also causes a PR interrupt, effectively yielding the CPU to a program of lower priority; applying INTQ<sub>q</sub> to an idle buffer has no effect. PHU<sub>n</sub>:D0 is set to 1 if an INTQ<sub>q</sub> instruction is responsible for program level *n* being held up. All INTQ<sub>q</sub> holdups are cleared when *any* transfer terminates.

### 3.4: I/O CONTROL

IOC provides the interface between the CPU and the I/O devices on their buffers. It contains three sequence units. The ESU1 and ESU2 units provide a conduit for the transfer of data between main store and the buffers. The RSU initializes and finalizes I/O transfers, manages the Lock-Out and PHU Stores, and passes the parameters of the I/O instruction to the relevant buffer. At the end of the transfer the RSU requests the appropriate interrupt from MC.

The **C Store** provides one 48-bit word for each buffer. It contains the transfer's initial, final and current word addresses, the 2-bit priority level at which the transfer was requested, and a flag for Director transfers. The initial address is retained to allow a fast-device operation to be restarted automatically should a parity error be detected during the transfer. Slow device buffers use those 15 bits for different purposes: they hold the next character position within the current word, and indicate transfer options such as 'character' mode.

The **E Store** is a  $16 \times 4$ -bit FIFO that the buffers use to request service from IOC. When it needs attention, a buffer inserts its device number at the end of the E Store. The ESUs take the buffer number at the head of the FIFO, using it to select a C Store and so discover what needs to be done. The IOC can be seen as a multiplexor channel, in more modern language, capable of both byte and word transfers, but not having a 'burst' mode, locked on to a single device. The total I/O rate of the maximum complement of EE devices (8 of the faster tape decks, a disc drive, and a drum store—slow devices are irrelevant) does not exceed the bandwidth of the core store, so burst mode is not needed. However, one installation [WR67] did modify the E Store mechanism to allocate store cycles on a priority basis, with higher priority for faster devices, their motivation being the need to connect a very fast non-standard device.

## 4: DIRECTOR STATE

### 4.1: INTERRUPTS

KDF9 interrupts may be either **voluntary** (OUT or INTQ<sub>q</sub> obeyed by the problem program); **inadvertent** (illegal instruction, nest over-/under-flow, etc); or **housekeeping** (monitor typewriter, clock and I/O device interrupts).

The special register RFIR (Reason For Interrupt Register) records the currently requested interrupt(s). RFIR is inspected from time to time by MC. When an interrupt is accepted, control branches to word 0 of Director, leaving an instruction address in the top cell of the SJNS. On entry to Director state, NIFF (Non-Interrupt Flip-Flop) is set to inhibit further interrupts, but not their recording. However, the RESET interrupt is never inhibited and the NOUV interrupt is completely suppressed. NIFF is also, in effect, the Director-state flag, so that an interruptible Director mode of execution does not exist.

When fetched by the privileged K4 instruction, RFIR is automatically cleared. K4 delivers the following to N1:

- D0-D15: CLOCK COUNT; the integer in D0-D15 is incremented every  $32\mu\text{s}$ ; a CLOCK interrupt occurs when overflow from D1 sets D0, but if D0 is already set a RESET interrupt occurs instead ('double clocking')
- D22: the PR interrupt is caused by the end of a peripheral transfer started by a program of higher priority than the program currently running; the PR interrupt is also caused by an INTQ<sub>q</sub> instruction applied to a busy device
- D23: the FLEX (Flexowriter) interrupt is caused by the interrupt key on the console typewriter
- D24: the LIV (Lock-In Violation) interrupt is caused by an illegal or privileged instruction, by the use of an unallocated peripheral, by store address wraparound, or by a negative effective address
- D25: the NOUV (Nest Over- or Under-flow Violation) interrupt occurs when an attempt is made to overfill an already-full nest or SJNS, or to pop an already-empty nest or SJNS
- D26: the EDT (End of Director Transfer) interrupt occurs at the end of a peripheral transfer initiated by Director, or if a priority inversion has been detected among programs currently locked-out and waiting for a shared device
- D27: the OUT (system-call) interrupt is caused by the OUT system-call instruction
- D28: the LOV (Lock-Out Violation) interrupt is caused when a program attempts to access any of a locked-out group of 32 words, or when it addresses a busy peripheral device other than by BUSYQ<sub>q</sub> or INTQ<sub>q</sub>
- D29: the RESET interrupt is caused by jumping to an invalid syllable address (6 or 7), or by a watchdog 'double clocking' (implying that a previous CLOCK interrupt has not been handled by Director in the intervening time)

If a store-access instruction with Q Store updating experiences a Lock-Out Violation, the Q register update is suppressed. When the lock-out clears, and the interrupted program is resumed, the instruction can be restarted without doubly updating the Q register.

### 4.2: NESTING-STORE MANAGEMENT BY HARDWARE AND DIRECTOR

The nest depth is represented by a 5-bit counter register. Its least significant four bits provide the address of the cell to be accessed in the nest's 16-word core stack. NOUV is signalled in program state when a push completes with the depth equal to 16+1, and when a pop completes with the depth equal to 0-1. Since NOUV is completely suppressed when Director is running, it can use all 19 nest cells—the 16 in the core stack and the three flip-flop registers, N1-N3.

To maximize hardware speed in a critical path, the destructive reading of a cell in the nest core stack is *not* followed by a write cycle to restore the contents; so reading a cell also clears it. As the nest is a strictly LIFO store, this is not a problem: no attempt can be made to read the cleared value a second time.

Similarly, and again for speed, writing to a cell in the nest core stack is *not* preceded by a read cycle to clear it; so the written bit pattern is effectively OR-ed into the cell. This is valid only if the cell contained zero before the write cycle. Therefore, before entering a program for the first time, Director completely empties its nest—not merely setting the depth to zero, but explicitly clearing the contents. The ERASE order pops the nest, forcing a destructive read from the core stack into N3. By performing enough ERASE orders, Director can ensure that all 19 nest cells are completely

zeroized. Subsequent pushes, to a depth less than four, leave the state of the core stack unchanged, although the nest depth increases. The first push that increases the depth to four may cause a non-zero value to be saved in the core stack (at cell 3). This leaves cells 0-2 clear, even when the nest depth reaches 16, the most allowed to a problem program.

Director's short-path interrupt handler saves N1-N3 at zero cost, as a side-effect of pushing three of its own values onto the nest. There is always room for N1-N3 in the core stack—at worst, in cells 0-2. Before returning to an interrupted program, Director performs three ERASE orders, thereby restoring the values that N1-N3 contained before the interrupt. If the interrupted program's nest depth was less than three, one or more non-significant zero words will be fetched into the N-registers; since the NOUV interrupt is suppressed in Director, no harm results. The round-trip cost of preserving the nest in the short path is therefore just three ERASE orders, taking  $3\mu\text{s}$ . This makes context switching in response to PR interrupts rather efficient, the time consumed between interruption and re-entering a program being of the order of  $320\mu\text{s}$ , or about 60 instructions. The 'long path' through Director, where more complex work is done, must save more of the interrupted program's registers; but that happens relatively infrequently.

If a program causes a NOUV interrupt by overfilling the nest, cells 0-2 of its core stack might be corrupted; but this does not matter, because the program cannot be allowed to continue unless the nest and SJNS are re-initialized.

There is one weakness in the implementation of the nest—the depth is checked only after it changes. A consequence of the reasoning described above is that, for correct LIFO operation, N2 must be clear when the nest depth is 1. But REV, for example, *can* be executed when the depth is only 1, and may swap a non-zero value into N2 without a NOUV. Two pushes later this non-zero value enters the core stack, changing the pristine 0 in cell 2, so that a future write into that cell will be corrupted. However, a program that commits such a gross error is likely to soon attempt a dyadic operation, and will fail at that point. In any case, it affects only itself, and only by making its debugging more difficult.

The implementation of the SJNS is very similar to that of the nest, with a similar division of responsibility between hardware and software. The Jump Buffer, a single flip-flop register analogous to the nest's three N-registers, constitutes the 17th cell of the SJNS. It ensures that interrupts can be taken when the SJNS is full, and it promotes the efficiency of the Director's short path. NOUV is caused after a program pushes a 17th link, and after it pops an empty SJNS. The excess link left in the Jump Buffer by a 17th push is overwritten by the NOUV interrupt's return address. Again, this does not matter, because the program cannot be allowed to continue unless the nest and SJNS are re-initialized.

#### 4.3: DIRECTOR-ONLY REGISTERS AND INSTRUCTIONS

Several special registers, concerned with multiprogramming, are accessible only to Director, although they exercise their influence on the running of problem programs.

The **Base Address** (BA), a 10-bit register, contains the number of the 32-location group of words at which the program starts. It is added to the top ten bits of virtual addresses to effect dynamic relocation. The **Number Of Locations** (NOL), also a 10-bit register, contains the BA-relative group number of the highest addresses the program can validly access. I/O instructions specify virtual addresses in  $Iq$  and  $Mq$ . MC checks that  $Iq \leq Mq$ , checks that both are compatible with NOL, and adds to both the value in BA, before passing them on to IOC. A failed check causes a LIV interrupt. On entry to Director BA is set to 0, so Director runs at the bottom of core. Strangely, NOL is still compared with effective addresses in Director state; Director must set NOL to the maximum for the machine, if it wants to guard against causing a LIV interrupt when accessing main store above its own upper limit.

The **Current Peripheral Device Allocation Register** (CPDAR), a 16-bit register, has one bit for each buffer. If a bit is set, the program can validly command the corresponding buffer; if the bit is clear, any such attempt causes a LIV interrupt. The **Current Priority Level** (CPL), a 2-bit register, holds the hardware dispatching priority of the program.

These registers contain values that are proper to one running program, and Director must set them accordingly before (re-)entering that program. The hardware timesharing option provides four instances of the nest, Q Store and SJNS. On context switching between programs, Director can switch quickly between instances by changing the active set number, but must separately save and restore the nest depth, the SJNS depth, BA, NOL, CPL, CPDAR, and the Overflow and Test registers.

There are four of the PHU (Program Hold-Up) registers. For scheduling reasons, Director might want to change the priority level of a program. Since a program's priority fixes the PHU that is used by the buffers to update its dispatching status, Director must wait for all of its I/O transfers to terminate before assigning it to a different priority.

Instructions to manipulate the special registers are available only in Director state. They are:

- EXITD: clear NIFF and jump to the address in the top cell of the SJNS; interrupts are inhibited throughout the execution of EXITD, to allow the machine to adopt a stable state
- CLOQ $q$ : clear any lock-outs for the area specified by  $Iq$  and  $Mq$ , and clear PHU $n$ , where  $n$  is the current value of CPL
- TLOQ $q$ : test the area specified by  $Iq$  and  $Mq$ , setting the Test Register if any of the encompassed groups are locked out
- CTQ $q$ : terminate device activity and clear lock-outs for the transfer specified by  $Qq$
- =K0: if  $N1 \neq 0$  then switch buzzer on else switch buzzer off end
- =K1: copy bits N1:D24-D33 to NOL, bits N1:D34-D35 to CPL and bits N1:D38-D47 to BA
- =K2: copy bits N1:D32-D47 to CPDAR, where N1:D32 corresponds to buffer 15 and N1:D47 to buffer 0
- =K3: switch to a new Q Store/nest/SJNS set, with N1:D0-D1 as the new register set number, N1:D2-D6 as the nest depth and N1:D7-D11 as the SJNS depth
- K4: push CLOCK/RFIR onto nest.
- K5: push PHUi:D6-D11 onto nest, for  $i$  in 0..3, with PHU0:D6-D11 in N1:D0-D5, PHU1:D6-D11 in N1:D6-D11, PHU2:D6-D11 in N1:D12-D17, and PHU3:D6-D11 in N1:D18-D23.
- K7: push the current register set number and nest depths, as represented for the =K3 instruction.

The =K3 instruction must be followed by at least six DUMMY instructions, since it needs  $6\mu\text{s}$  to take effect and during this period the machine is in an indeterminate state. All =K $k$  instructions *copy* N1, and do not pop it.

Note that register sets are numbered independently of program priority levels. If Director exchanges the priority levels of a pair of programs, it does *not* need to swap the contents of their register sets.

Two other Director-only instructions have to do with I/O: PMGQ $q$  and PMHQ $q$  do not seem to have more specific Usercode mnemonics. The semantics of PMHQ $q$  are clear: it is a 'set lock-out' instruction, the complement of CLOQ $q$ . It is used in Director to lock out an area of core as it would have been by a data transfer in problem program mode.

PMGQq does not seem to be used in any extant Director, although it is attested in marginal notes taken during a KDF9 training course, where it is described as 'read C Store'. Perhaps it is used in I/O diagnostic programs.

## 5: MAIN STORE ORGANISATION

Main store is composed of **modules** of 4K words, each consisting of four 1K **blocks**. A block consists of 48 core planes of  $32 \times 32$  bits (the Z, X and Y directions, respectively), the whole organised as follows.

Registers associated with the main store include the 15-bit main-store address register (MSAR) and the 48-bit main buffer (MSB). The most significant three bits of MSAR give the module number, the next two give the block within the module, the next five give the X-line and the last five the Y line. A series of decoding matrices interpret each of the fields, giving access either to a further matrix, or, eventually, to the X and Y lines themselves.

The read sequence operates as follows:

- 1: the matrix system is pulsed to select the X and Y wires. The cores that flip produce a detected signal in the readout wires, which are organised one per XY plane, per module. The output from the read amplifiers is transferred to MSB. If reading is to be inhibited on certain planes (i.e. for a halfword access), an inhibit current cancels the effect of the X-line current and the cores on those planes do not flip.
- 2: A side effect of the read step is to clear the addressed word, which must therefore be rewritten from MSB. Again the inhibit wires may come into play to safeguard an un-accessed halfword.

The complete read/restore operation is a **core cycle** and takes  $6\mu s$ .

The write sequence is essentially similar. Step 1 prepares the word by setting it to zero. Step 2 ORs-in its new value, the inhibit wires being used as before to prevent unwanted overwriting during halfword accesses.

## 6: MICROPROGRAM SEQUENCING

The sequence unit directing a KDF9 subsystem is a finite-state machine. Each state transition emits microprogram control pulses to initiate primitive hardware micro-operations, such as register-to-register transfers. It also emits a new state value that is fed back to the sequence unit. Control and state outputs may be dependent on input signals, allowing for conditional microinstructions and for microprogram looping.

The MC sequence unit, for example, is similar to that used in AC. It has two  $8 \times 8$  arrays of pulse transformers, and two 6-bit microinstruction registers (MIS1 and MIS2). Activated by a P1 pulse ( $0.25\mu s$ , every  $1\mu s$ ), the contents of MIS1 are decoded to select one of the transformers, which then has its primary winding pulsed: six of its secondary windings respond with outputs that set a successor microinstruction number in MIS2. MIS2 similarly puts a new microinstruction number into MIS1 on a P2 pulse (like P1, but  $0.5\mu s$  later). Alternating between MIS1 and MIS2, the sequence unit steps through a microcode procedure that effects MC's contribution to a KDF9 machine code instruction.

As well as their state outputs, the transformers also generate up to two control outputs that trigger micro-operations within MC. Up to four micro-operations can therefore be commanded in each  $1\mu s$  clock cycle.

Windings can be connected to signals from other parts of the machine, to make output dependent on conditions external to the sequence unit. For example, we can surmise that at least one transformer taking part in the  $Jr < Z$  microcode has at least one output that depends on input from N1:D0 (the sign bit of the number at the top of the nest).

The sequence unit technique is very flexible, and offers many opportunities for context-dependent optimization of the transformer arrays. Within an array a given transformer need only be wired with those signals necessary for the effect wanted. A single transformer can take part in more than one microprogram sequence, with varying effects being conditioned by control inputs connected to it. Arrays can be configured with just enough transformers for their tasks. This all results in significant economies. For example, the sequence unit for a tape punch buffer contains two arrays of only five transformers each, addressed by 3-bit microinstruction number registers.

A basic KDF9, ignoring I/O buffers, uses eight sequence units. MC has one, AC and IOC have three each, and the core store has one to sequence read/write cycles. A fully-specified machine with 16 I/O buffers has no fewer than 24.

## 7: MAIN CONTROL

The instruction currently being obeyed by MC is held in a 27-bit register called the **instruction staticizer** (INS). INS contains three whole syllables, the first two bits of the following syllable and the first bit of the syllable after that. (AC on the other hand only examines the first syllable of any instruction and so has a 1-syllable staticizer called AINS.) The outputs from INS are used to feed a decoder matrix developing 46 'function levels' which initialize the MIS1 register in the MC sequence unit. That is, they specify entry to different points of the MC sequence unit's microprogram; these are the 'S' numbers listed in Appendixes 1 and 2.

MC is responsible for all two- and three-syllable instructions. Therefore it is responsible for initiating main store cycles, for Q Store operations, for initiating transfers to and from nest, for setting up jumps, for maintaining the SJNS, for initiating I/O operations, and for dealing with interrupts. Instructions that do not involve the nest are implemented entirely by MC, with no participation by AC. Instructions that do involve the nest are executed co-operatively by MC and AC, using buffer and interlock registers to synchronize their actions. These registers are many and various; only the most important are identified in the following.

To implement a write into main store, MC copies the destination address to the Store Buffer Address Register (SBAR); the cycle is initiated after the Store Buffer (SB) is set with a value by AC, at a time when MC next encounters a main store operation.

To implement a read from main store, MC fills MSAR with the source address. If MSAR = SBAR at that point, SB is immediately copied to MSB, thus eliminating a redundant core cycle; otherwise MC initiates a read cycle. When the cycle completes, MC transfers the word from MSB to whichever of the two fetch buffers (FB0, FB1) is free, from whence AC takes the operand for the nest. If both fetch buffers are in use, MC waits for AC to free one of them.

As part of a main store read or write operation, MC also checks for LIV and LOV, in parallel with the actual core cycle if possible. If either condition obtains, the transfer of data is abandoned and the interrupt sequence begins.

To implement an operation on the Q Store, analogous arrangements are made, involving the Q Store Buffer (QB), the Address of item in QB (AQB), and the Q Store Address Register (QSAR). In the case of shift instructions, the shift amount is taken from INS or from Cq as necessary, and placed in QB for transfer to AC and thence to Shift Control.

A modicum of complexity is introduced into SJNS operations to avoid delays due to updating the SJNS depth counter. The trick is to have two of them, so that the SJNS core stack is addressed alternately by the SJNS Even Address Register (JEVAR) for even-numbered locations, and the SJNS Odd Address Register (JODAR) for odd-

numbered locations. During an SJNS operation one counter is used as the current stack pointer while the other is concurrently incremented or decremented to become the new stack pointer. The JTOG flip-flop indicates which of the two stack pointers is to be used. It is not clear why this was thought worthwhile for the SJNS but not the nest.

MC's other great responsibility is instruction fetching. Address registers involved include the Next Instruction-Word Address (NIWA); the Next Instruction (syllable) Address (NIA); and the Current Instruction (syllable) Address (CIA). AC's equivalent to NIA is ANIA and ADIA is its equivalent to CIA. NIWA, NIA, CIA, ANIA and ADIA jointly serve the role of a 'program counter' register in a more conventional architecture.

There are two Instruction-Word Buffers (IWB0 and IWB1), each being a 48-bit flip flop register. MC transfers words containing instructions to whichever of the IWBs is not currently being inspected by AC. CIA is a 4-bit (1+3) register containing the IWB number and the syllable number of the first syllable of the instruction that MC is currently obeying. NIA is like CIA, but holds the address, within the IWBs, of the first syllable of the instruction MC is next to consider. The three extra bits in INS are used by the logic that increments CIA to get the value of NIA. The absolute value of (NIA-CIA) is less than or equal to four, so MC can skip up to four syllables of AC-only instructions within the same IWB, in zero time. If NIA points to the 'other' IWB from CIA, MC is forced to consider the next syllable on from the end of the present instruction, even if it is AC-only, for a  $1\mu s$  penalty.

An instruction word is transferred to a free IWB according to the following logic:

- If NIA points to the 'other' buffer from CIA, i.e. (NIA:D0  $\neq$  CIA:D0), and ANIA points to the same buffer as CIA, i.e. (ANIA:D0 = CIA:D0).
- If (NIA:D0 = ANIA:D0)  $\wedge$  (NIA:D0  $\neq$  CIA:D0) then AC is lagging more than six syllables behind, so MC is held up until AC exits its present IWB.
- If (NIA:D1-D3 > 3)  $\wedge$  (NIA:D0 = CIA:D0), and the syllable addressed by NIA indicates that the next instruction extends beyond the end of the current IWB, then a new word is fetched into the IWB given by  $\neg$ NIA:D0.

Instruction fetching is disabled by a flip-flop called SPIN. If MC finds SPIN set at the start of its cycle, it leaves it set at the end, only unsetting SPIN if it got set during the cycle. This mechanism is used to prevent instruction fetching by the JrCqNZS short-loop instruction, which keeps SPIN set until  $Cq = 0$ .

The *physical* address of the instruction word to be fetched is held in NIWA, a 13-bit flip-flop register. On unconditional jumps, or successful conditional jumps, the target word's address is converted from virtual to physical form, and the result written to NIWA; its syllable address is copied from INS to NIA. On completion of the instruction fetch cycle, an IWB is filled from MSB.

The target syllable number of most jumps is constant, and held in the first syllable of the instruction where it is readily accessible to AC. EXIT and EXITD are exceptions—their target addresses are dynamically determined; so their target IWB-number and syllable-number are made available to AC in the JAB register.

Conditional jumps fall into two categories: those that are dependent on values in the nest and those that are not. The latter are implemented entirely in MC. The former require MC to synchronize with AC, which computes the value of the jump condition, passes it to MC, and waits. MC steers instruction fetching accordingly; when it has completely finished with the jump, AC is allowed to continue.

RFIR is inspected from time to time by MC, depending on the instruction being executed. It is not inspected during any of the following, so an interrupt cannot occur in any of those places: an *unsuccessful* conditional jump; shift instructions; Q Store to Q Store transfers; =LINK; OUT; =Qq, etc; and all single-syllable instructions. Interrupts are inhibited throughout the execution of the EXITD instruction, to allow the machine to adopt a stable state.

## 8: ARITHMETIC CONTROL

AC obeys 1-syllable instructions and looks at the first syllable of multi-syllable instructions, some of which require AC's participation. The main sequence unit of AC coordinates its activities with MC, and implements simple orders such as nest manipulation, fixed-point addition and subtraction, and logical operations on bit patterns. Shifts are delegated to Shift Control, and multiplication and division to a dedicated Multiplier/Divider. The latter both have their own sequence unit which acts, in effect, as a subroutine of AC.

## 9: TIMING INSTRUCTION SEQUENCES

It is difficult to give a definitive execution time for a sequence of KDF9 instructions, because it depends on many dynamically-determined interlock conditions, and these may be influenced by the preceding instruction sequence. The following summarizes the main issues.

- AC does not begin executing operations which MC has not completed.
- AC can execute instructions only after MC has finished with them, but can inspect them earlier.
- AC takes only  $1\mu s$  to deal with a DUMMY instruction, or one it treats as a DUMMY: these are M $\pm$ Iq, NCq, DCq, Iq= $\pm 1$ , Iq= $\pm 2$ , Q-to-Q transfers, =Kk, and all I/O instructions.
- AC halts MC when it wants to use the Q Store or the SJNS. More precisely, AC sets NOG and waits for MC to stop, clearing NOG, which cues AC to complete the transfer and restart MC.
- MC waits for AC to compute the condition in a nest-dependent conditional jump; AC then waits until MC has completed the jump.
- MC does not attempt a jump if the JAB interlock is set.
- MC does not initiate a shift if the shift interlock is set.
- MC does not replace the contents of an IWB until AC has finished with it.
- MC does not action store write cycles until SBARM is clear.
- MC does not deal with an instruction of the =Qq class if AC has not dealt with the last such instruction. MC sets AQBM on initiating Q Store housekeeping, and waits at the next such command until AC clears it.
- MC does not action an SJNS instruction until the SJNS interlock is clear; it is set only by =LINK or LINK.
- MC copies previously-fetched data from SB if (SBAR = MSAR)  $\wedge$  SBARM.
- MC copies previously-fetched data from QB if (AQB = QSAR)  $\wedge$  AQBM.
- MC takes only  $1\mu s$  to dismiss a 1-syllable instruction it is forced to consider.
- MC waits for AC to catch up before entering an interrupt sequence. Interrupts are requested by setting SKIN, which stops MC at the end of the current instruction. AC stops when it catches up, and the interrupt is effected as soon as

$\text{SKIN} \wedge (\text{NIA} = \text{ANIA}) \wedge \neg \text{NIFF}$ . This has the curious consequence that an empty-nest NOUV may not be serviced until the nest has been refilled by subsequent AC-only instructions. The link saved on a NOUV interrupt is the address at which NIA equals ANIA, which may be several syllables past the failing instruction.

**EXAMPLE 1—TO COMPUTE THE DOUBLE-PRECISION SCALAR PRODUCT  $\sum x_i y_i$**

On entry to the loop,  $Q1 = n/1/0$ , where  $n$  is the length of the vectors; the vector  $x$  is in the variables  $YX1\dots YXn$ ,  $y$  is in the variables  $YY1\dots YYn$ ; and the nest contains a pair of zeros. The  $x+F$  instruction is a ‘multiply and accumulate’ operation: it forms the double-precision product  $N1 \times N2$  and adds that to the double-precision sum in  $N3$  and  $N4$ .

```
*1;  YY1M1; YY1M1Q; x+F;
    J1C1NZS;
```

| $\mu s$ in MC               | Instruction  | $\mu s$ in AC               |
|-----------------------------|--------------|-----------------------------|
|                             | *1           |                             |
| 6                           | YY1M1        | 2                           |
| 7                           | YY1M1Q       | 2                           |
| 0                           | x+F          | 19                          |
| 4                           | J1C1NZS      | 2                           |
| <b>17<math>\mu s</math></b> | <b>TOTAL</b> | <b>25<math>\mu s</math></b> |

The first pass through takes  $36\mu s$ : AC must wait for MC to finish each of the first two fetches, and MC must set up the short-loop mode when it first encounters the J1C1NZS instruction. Subsequent iterations of the loop take only  $25\mu s$ : the time in MC is completely overlapped by the time AC takes in the  $x+F$  instruction. Note that a faster core store would not improve the KDF9 time significantly.

The Ferranti Atlas 2, a contemporary of KDF9, allowed some overlap between integer and floating-point units, but otherwise sequenced instructions conventionally. It had much faster floating-point arithmetic than KDF9 and its scalar product loop took from  $11.9\mu s$  (with  $2.5\mu s$  core store) to  $25.9\mu s$  (with  $5\mu s$  core store).

Using the value of  $\text{loop time} \div \text{core cycle time}$  as a measure of an instruction set’s efficiency, KDF9’s architecture is between 15% and 20% better in this important algorithm.

**EXAMPLE 2—TO COMPUTE THE POLYNOMIAL  $\sum a_i x^i$**

On entry to the routine N1 contains  $n$ , the order of the polynomial; the value  $x$  is in N2; and the vector of coefficients  $a$  is in the variables YA0…YAn.

```
DUP; =C1; =M1; I1=-1; =Q2; YA0M1;
    J2C1Z;
    DC1; M+I1;
*1;   Q2; xF; YA0M1Q; +F;
    J1C1NZS;
2; EXIT 1;
```

Again, the first iteration of the loop takes longer due to once-only setup time. Subsequent iterations of the loop take  $28\mu s$ , the time in MC being completely overlapped by the floating-point arithmetic. In this case the comparison with Atlas 2 is less favourable to KDF9. Atlas 2 took from  $7.4\mu s$  to  $13.7\mu s$  per iteration, its advantage being primarily in the floating-point arithmetic, which is between  $12\mu s$  and  $15\mu s$  faster.

It is interesting to compare KDF9 with other computers of its era. The following table uses data from [Longbottom]. Consider the Burroughs 5000 and the KDF9. They have the same main store cycle time and word length, but KDF9 is nearly three times as fast. Another way of putting it is that KDF9 needs about one store cycle time on average for each mix ‘instruction’ executed, while the B5000 needs nearly three.

| <b>KDF9</b>           | <b>6<math>\mu s</math></b>  | <b>170 KIPS</b>              | <b>0.98</b>                          |
|-----------------------|-----------------------------|------------------------------|--------------------------------------|
| ATLAS 1               | 2 $\mu s$                   | 350 KIPS                     | 1.43                                 |
| Honeywell 1800        | 2 $\mu s$                   | 295 KIPS                     | 1.69                                 |
| <b>Burroughs 5500</b> | <b>4<math>\mu s</math></b>  | <b>144 KIPS</b>              | <b>1.72</b>                          |
| UNIVAC 1107           | 4 $\mu s$                   | 131 KIPS                     | 1.92                                 |
| CDC 3600              | 1.5 $\mu s$                 | 337 KIPS                     | 1.96                                 |
| <b>Burroughs 5000</b> | <b>6<math>\mu s</math></b>  | <b>60 KIPS</b>               | <b>2.78</b>                          |
|                       | <b>store<br/>cycle time</b> | <b>Gibson mix<br/>orders</b> | <b>cycle times<br/>per mix order</b> |

The Whetstone benchmark, from reported figures, runs at 178KIPS. Execution on an emulator, **ee9** [FindlayUG], of a group theory program due to John Leech gives a rate of 184KIPS. These numbers indicate that KDF9’s actual speed is close to or better than its Gibson mix rating.

A significant contribution to KDF9’s excellent performance is the concurrent instruction fetching carried out by MC while AC is obeying orders in the ‘other’ IWB. Until recently it was not clear how well this worked, but investigations with **ee9** show that about 40% of the instruction fetch time is overlapped when running the Whetstone Benchmark with the Whetstone interpreter, and about 36% overlap is achieved when running it with the Kidsgrove compiler.

For more context on the performance of the KDF9, see [FindlayBM].

## APPENDIX 1: KDF9 INSTRUCTION SET AND EXECUTION TIMES

Waits caused by busy-resource interlocks between AC and MC are not included in these times, which are given in  $\mu$ s.

Annotations of the form ' $(x\mu s)$ ' mean that the entailed core cycle begins  $x \mu$ s after MC starts executing the instruction. '\*' means that AC must catch up with MC before MC starts executing the order; AC then waits for MC to finish before executing the order itself. '§' means that AC waits for MC to stop, taking at least 2  $\mu$ s, before it starts executing the order; MC then waits for AC to finish before restarting.

The types of operands are declared using the following indicators: D: 96-bit fixed-point; DF: 96-bit floating-point; F: 48-bit float; FH: 24-bit float; H: 24-bit fixed; I: 48-bit fixed; W: 48 bit non-numerical word. Popping the nest into an operand  $x$  is denoted  $\uparrow x$ ; pushing  $x$  on to the nest is denoted  $\downarrow x$ ; popping the SJNS is denoted  $\uparrow\uparrow$ ; pushing the SJNS is denoted  $\downarrow\downarrow$ . As an operand,  $\uparrow$  denotes the popped top of the nest, and  $\uparrow\uparrow$  denotes the popped top of the SJNS. Only net observable effects are shown; no attempt is made to document the actual microcode sequence that implements an order. See [EEC62a] for more complete information on microcode sequences and instruction timing.

|      | <b>Jumps</b>                                               | <b>AC</b> | <b>MC</b>                                                                      | <b>Effect</b>                                                                                                               |
|------|------------------------------------------------------------|-----------|--------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------|
| S1   | JS $r$                                                     | 2         | 11 (4.5 $\mu$ s)                                                               | Jump to subroutine: $\downarrow\downarrow$ NIA                                                                              |
| S2   | JrC $q$ Z, JrC $q$ NZ                                      | 2         | 4 unsuccessful;<br>11 (4.5 $\mu$ s) successful                                 | Jump if C $q$ is (Not) Zero                                                                                                 |
| S3   | Jr                                                         | 2         | 8 (1.5 $\mu$ s)                                                                | Jump unconditionally                                                                                                        |
| S4   | JrC $q$ NZS                                                | 2         | 4 unsuccessful;<br>11 (4.5 $\mu$ s) successful first time, but only 4 normally | Jump if C $q$ $\neq$ 0 Special {to the start of the preceding word in the instruction buffer, and not reloading the latter} |
| S5*  | Jr=, Jr $\neq$                                             | 2         | 5 unsuccessful;<br>12 (5.5 $\mu$ s) successful                                 | Jump if $\uparrow=N2$ , $\uparrow\neq N2$ , respectively                                                                    |
| S6*  | Jr=Z, Jr $\neq$ Z, Jr<Z,<br>Jr $\geq$ Z, Jr>Z, Jr $\leq$ Z | 2         | 4 unsuccessful;<br>11 (4.5 $\mu$ s) successful                                 | Jump if $\uparrow=0$ , $\uparrow\neq 0$ , etc;                                                                              |
| S7*  | JrEN, JrNEN, JrEJ,<br>JrNEJ, JrTR,<br>JrNTR,<br>JrV, JrNV  | 2         | 3 unsuccessful;<br>10 successful (3.5 $\mu$ s)                                 | Jump if (Not) Empty NEST, (Not) Empty SJNS,<br>(Not) Test Register,<br>(Not) Overflow                                       |
| S8*  | EXIT offset                                                | 2         | 12 (5.5 $\mu$ s), but 13 if offset<br>is an odd number                         | NIA := (offset + $\uparrow\uparrow$ )<br>{return from subroutine; also switch to case}                                      |
| S9*  | EXITD                                                      | 2         | 12 (5.5 $\mu$ s)                                                               | $\uparrow\uparrow$ NIA; NIFF := 0 {Director-only: enter program}                                                            |
| S10  | LINK                                                       | 2         | 4                                                                              | $\downarrow\uparrow$                                                                                                        |
| S11§ | =LINK                                                      | 2         | 3                                                                              | $\downarrow\uparrow$                                                                                                        |

### KEY:

$r$  is an instruction label (an integer in Usercode);  $k, q$  are Q Store numbers in the range 0..15

|     | <b>Q Store</b> | <b>AC</b> | <b>MC</b> | <b>Effect</b>                   |
|-----|----------------|-----------|-----------|---------------------------------|
| S12 | M+I $q$        | 1         | 4         | M $q$ := M $q$ + I $q$          |
| S13 | M-I $q$        | 1         | 5         | M $q$ := M $q$ - I $q$          |
| S14 | NC $q$         | 1         | 5         | C $q$ := - C $q$                |
| S15 | DC $q$         | 1         | 3         | C $q$ := C $q$ - 1              |
| S16 | I $q$ =1       | 1         | 3         | I $q$ := 1                      |
| S17 | I $q$ =-1      | 1         | 3         | I $q$ := -1                     |
| S18 | I $q$ =2       | 1         | 3         | I $q$ := 2                      |
| S19 | I $q$ =-2      | 1         | 3         | I $q$ := -2                     |
| S20 | Q $k$ TOQ $q$  | 1         | 4         | Q $q$ := Q $k$                  |
| S20 | C $k$ TOQ $q$  | 1         | 4         | C $q$ := C $k$                  |
| S20 | I $k$ TOQ $q$  | 1         | 4         | I $q$ := I $k$                  |
| S20 | M $k$ TOQ $q$  | 1         | 4         | M $q$ := M $k$                  |
| S20 | IM $k$ TOQ $q$ | 1         | 4         | I $q$ := I $k$ ; M $q$ := M $k$ |
| S20 | CM $k$ TOQ $q$ | 1         | 4         | C $q$ := C $k$ ; M $q$ := M $k$ |
| S20 | CI $k$ TOQ $q$ | 1         | 4         | C $q$ := C $k$ ; I $q$ := I $p$ |
| S21 | Q $q$          | 2         | 4         | $\downarrow Qq$                 |
| S22 | C $q$          | 2         | 5         | $\downarrow Cq$                 |
| S23 | I $q$          | 2         | 6         | $\downarrow Iq$                 |
| S24 | M $q$          | 2         | 4         | $\downarrow Mq$                 |

|     | <b>SET Instruction</b> | <b>AC</b> | <b>MC</b> | <b>Effect</b>                |
|-----|------------------------|-----------|-----------|------------------------------|
| S25 | SET <i>i</i>           | 2         | 4         | <i>i</i> : I; $\downarrow i$ |

**KEY:**

*i* is a constant in the range -32768..+32767

|       | <b>Q-store</b>   | <b>AC</b> | <b>MC</b> | <b>Effect</b>                           |
|-------|------------------|-----------|-----------|-----------------------------------------|
| S26 § | =Q <sub>q</sub>  | 2         | 2         | $\uparrow Qq$                           |
| S26 § | =C <sub>q</sub>  | 2         | 2         | $\uparrow Cq$                           |
| S26 § | =I <sub>q</sub>  | 2         | 2         | $\uparrow Iq$                           |
| S26 § | =M <sub>q</sub>  | 2         | 2         | $\uparrow Mq$                           |
| S27 § | =RC <sub>q</sub> | 2         | 3         | Reset <i>Qq</i> to 0/1/0; $\uparrow Cq$ |
| S27 § | =RI <sub>q</sub> | 2         | 3         | Reset <i>Qq</i> to 0/1/0; $\uparrow Iq$ |
| S27 § | =RM <sub>q</sub> | 2         | 3         | Reset <i>Qq</i> to 0/1/0; $\uparrow Mq$ |
| S28 § | =+Q <sub>q</sub> | 5         | 5         | $\downarrow Qq$ ; +; $\uparrow Qq$      |
| S29 § | =+C <sub>q</sub> | 5         | 6         | $\downarrow Cq$ ; +; $\uparrow Cq$      |
| S30 § | =+I <sub>q</sub> | 5         | 7         | $\downarrow Iq$ ; +; $\uparrow Iq$      |
| S31 § | =+M <sub>q</sub> | 5         | 5         | $\downarrow Mq$ ; +; $\uparrow Mq$      |

|     | <b>Shifts</b>                                                                                                                     | <b>AC</b>       | <b>MC</b> | <b>Effect</b>                                                                                                                                                                                         |
|-----|-----------------------------------------------------------------------------------------------------------------------------------|-----------------|-----------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| S32 | SHA $\pm n$ , SHAD $\pm n$ , SHL $\pm n$ , SHLD $\pm n$<br>where <i>n</i> is in the range -64 .. +63                              | 3+ <i>t</i>     | 2         | Shift top of NEST <i>n</i> bits left (+) or right (-), Arithmetic (N1), Arithmetic Double-length (N1:N2), Logical (N1), Logical Double-length (N1:N2). SHA- <i>n</i> rounds!                          |
| S32 | SHC $\pm n$<br>where <i>n</i> is in the range -48 .. +48                                                                          | 4+ <i>t</i>     | 2         | Shift N1 <i>n</i> bits left (+) or right (-), Cyclic                                                                                                                                                  |
| S32 | $\times +n$<br>where <i>n</i> is in the range -64 .. +63                                                                          | 16+<br><i>t</i> | 2         | <i>r, l</i> : I; <i>x, s</i> : D;<br>$\uparrow r, l; \uparrow s;$<br>$\downarrow x = (l \times r) \times 2^n + s$                                                                                     |
| S33 | SHAC <sub>q</sub> , SHADC <sub>q</sub> , SHLC <sub>q</sub> , SHLD <sub>q</sub><br>where C <sub>q</sub> is in the range -96 .. +96 | 3+ <i>t</i>     | 3         | Shift NEST C <sub>q</sub> bits left (+) or right (-), Arithmetic (N1), Arithmetic Double-length (N1:N2), Logical (N1), Logical Double-length (N1:N2). SHAC <sub>q</sub> rounds if C <sub>q</sub> < 0! |
| S33 | SHCC <sub>q</sub><br>where C <sub>q</sub> is in the range -48 .. +48                                                              | 4+ <i>t</i>     | 3         | Shift N1 C <sub>q</sub> bits left (+) or right (-), Cyclic                                                                                                                                            |
| S33 | $\times +Cq$<br>where C <sub>q</sub> is in the range -96 .. +96                                                                   | 16+<br><i>t</i> | 3         | <i>r, l</i> : I; <i>x, s</i> : D;<br>$\uparrow r, l; \uparrow s;$<br>$\downarrow x = (l \times r) \times 2^{Cq} + s$                                                                                  |

**KEY:**

*t* is shifting time in excess of 1 $\mu$ s, given by  $t = \lceil a/2 \rceil + (\text{if } b \neq 0 \text{ then } 1 \text{ else } 0) - 1$ , such that the absolute value of the shift length  $n = 8a + b$ ,  $0 \leq a < 12$ ,  $0 \leq b < 8$ .

|       | <b>Privileged</b> | <b>AC</b> | <b>MC</b> | <b>Effect</b>                                                                                  |
|-------|-------------------|-----------|-----------|------------------------------------------------------------------------------------------------|
| S34 * | =K0               | 1         | 3         | <b>if</b> N1 $\neq 0$ <b>then</b> switch on buzzer<br><b>else</b> switch off buzzer <b>end</b> |
|       | =K1               | 1         | 3         | Copy N1 to BA, CPL and NOL                                                                     |
|       | =K2               | 1         | 3         | Copy N1 to CPDAR                                                                               |
|       | =K3               | 1         | 3         | Copy N1 to nest set counters and register set number                                           |
| S35 * | K4                | 2         | 3         | $\downarrow$ RFIR and Clock                                                                    |
|       | K5                | 2         | 3         | $\downarrow$ PHU registers 0-3                                                                 |
|       | K7                | 2         | 3         | $\downarrow$ nest set counters and register set number                                         |

The K<sub>k</sub> registers can be accessed only in Director mode, as uncontrolled access may compromise system integrity.

| Fetch and Store |                                                                                                                                                                                                                                                                                                                | AC | MC                           | Effect                                                                                                                                                        |
|-----------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----|------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------|
| S36             | Direct Fetch:<br>EaM <sub>q</sub> , EaM <sub>q</sub> Q                                                                                                                                                                                                                                                         | 2  | 6 (3.5 $\mu$ s);<br>7 with Q | $\downarrow [a+Mq]$<br>With Q:<br>$Mq := Mq + Iq; Cq := Cq - 1$                                                                                               |
| S37             | Indirect Fetch:<br>M <sub>k</sub> M <sub>q</sub> , M <sub>k</sub> M <sub>q</sub> Q,<br>M <sub>k</sub> M <sub>q</sub> N, M <sub>k</sub> M <sub>q</sub> QN,<br>M <sub>k</sub> M <sub>q</sub> H, M <sub>k</sub> M <sub>q</sub> QH,<br>M <sub>k</sub> M <sub>q</sub> HN, M <sub>k</sub> M <sub>q</sub> QHN         | 2  | 7 (4.5 $\mu$ s);<br>8 with Q | $\downarrow [Mk+Mq]$<br>With Q:<br>$Mq := Mq + Iq; Cq := Cq - 1$<br>Halfword M <sub>q</sub> with H,<br>[(1+M <sub>k</sub> )+M <sub>q</sub> ] accessed with N. |
| S38             | Direct Store:<br>= EaM <sub>q</sub> , = EaM <sub>q</sub> Q                                                                                                                                                                                                                                                     | 1  | 6 (1.5 $\mu$ s);<br>7 with Q | $\uparrow [a+Mq]$<br>With Q:<br>$Mq := Mq + Iq; Cq := Cq - 1$                                                                                                 |
| S39             | Indirect Store:<br>=M <sub>k</sub> M <sub>q</sub> , =M <sub>k</sub> M <sub>q</sub> Q,<br>=M <sub>k</sub> M <sub>q</sub> N, =M <sub>k</sub> M <sub>q</sub> QN,<br>=M <sub>k</sub> M <sub>q</sub> H, =M <sub>k</sub> M <sub>q</sub> QH,<br>=M <sub>k</sub> M <sub>q</sub> HN, =M <sub>k</sub> M <sub>q</sub> QHN | 1  | 7 (1.5 $\mu$ s);<br>8 with Q | $\uparrow [Mk + Mq]$<br>With Q:<br>$Mq := Mq + Iq; Cq := Cq - 1$<br>Halfword M <sub>q</sub> with H,<br>[(1+M <sub>k</sub> )+M <sub>q</sub> ] accessed with N. |

**KEY:**

*a* is a main store address

| I/O |                   | AC | MC                                                                                                                                                                                                                                                                                                                                                                                                 |
|-----|-------------------|----|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| S40 | Peripheral Read   | 1  | TLOQ <sub>q</sub> :                                                                                                                                                                                                                                                                                                                                                                                |
| S41 | Peripheral Write  | 1  | 15+ <i>t</i><br>PMHQ <sub>q</sub> , CLOQ <sub>q</sub> :<br>16+ <i>t</i><br>PIxQ <sub>q</sub> , POxQ <sub>q</sub> :<br>22+ <i>t</i> , but:<br>15 if unallocated (LIV)<br>17 if device busy<br>18 if uncleared parity error<br>20 if LOV interrupt                                                                                                                                                   |
| S42 | Peripheral Gap    | 1  | MGAPQ <sub>q</sub> /POEQ <sub>q</sub> , MWIPEQ <sub>q</sub> /POFQ <sub>q</sub> :<br>19, but:<br>14 if unallocated (LIV),<br>16 if device busy<br>17 if uncleared parity error<br>19 if LOV interrupt                                                                                                                                                                                               |
| S43 | Peripheral Skip   | 1  | MFSKQ <sub>q</sub> /PMAQ <sub>q</sub> , MRWDQ <sub>q</sub> /PMDQ <sub>q</sub> ,<br>MBSKQ <sub>q</sub> /PMEQ <sub>q</sub> :<br>as S42                                                                                                                                                                                                                                                               |
| S44 | Peripheral Status | 1  | INTQ <sub>q</sub> :<br>12, but:<br>11 if unallocated (LIV)<br>13 if device busy<br>BUSYQ <sub>q</sub> , CTQ <sub>q</sub> :<br>13, but:<br>11 if unallocated (LIV)<br>13 if device busy<br>PARQ <sub>q</sub> , MBTQ <sub>q</sub> /PMBQ <sub>q</sub> ,<br>MLBQ <sub>q</sub> /PMCCQ <sub>q</sub> , METQ <sub>q</sub> /PMFQ <sub>q</sub> :<br>14, but:<br>11 if unallocated (LIV)<br>13 if device busy |

**KEY:**

*t* =  $\lceil (Mq - Iq) \div 32 \rceil$  is the time taken setting the lock-out store;

and PIxQ<sub>q</sub>, POxQ<sub>q</sub> take an additional 6 $\mu$ s core cycle per character or word, depending on the I/O device type

| <b>I/O</b>   | <b>Description</b>                                                                                  |
|--------------|-----------------------------------------------------------------------------------------------------|
| S40 TLOQq    | Test Lock-Out                                                                                       |
| S40 CLOQq    | Clear Lock-Out (Director-only)                                                                      |
| S40 PIAQq    | Read Forward                                                                                        |
| S40 PIBQq    | Read Forward to End-Message                                                                         |
| S40 PICQq    | Read {Character (Paper Tape, Cards), from Fixed Heads (Disc)}                                       |
| S40 PIDQq    | Read {Character (Paper Tape, Cards), from Fixed Heads (Disc)} to End-Message                        |
| S40 PIEQq    | Read {Backward (Mag. Tape), Alphanumeric (Cards), Next Sector (Disc)}                               |
| S40 PIFQq    | Read {Backward (Mag. Tape), Alphanumeric (Cards); Next Sector (Disc)} to End-Message                |
| ??? PIGQq    | Read {Alphanumeric Character (Cards), Next Sector from Fixed Heads (Disc)}                          |
| ??? PIHQq    | Read {Alphanumeric Character (Cards), Next Sector from Fixed Heads (Disc)} to End-Message           |
| S41 POAQq    | Write                                                                                               |
| S41 POBQq    | Write to End-Message                                                                                |
| S41 POCQq    | Write {Last block (Mag. Tape), Character (Paper Tape, Cards), to Fixed Heads (Disc)}                |
| S41 PODQq    | Write {Last block (Mag. Tape), Character (Paper Tape, Cards), to Fixed Heads (Disc)} to End-Message |
| S42 POEQq    | Write gap on output (magnetic- or paper-) tape                                                      |
| S42 POFQq    | Wipe long gap on magnetic tape                                                                      |
| ??? POGQq    | Write {Alphanumeric (Cards), Next Sector (Disc)}                                                    |
| ??? POHQq    | Write {Alphanumeric (Cards), Write Sector (Disc)} to End-Message                                    |
| ??? POKQq    | Write {Alphanumeric Character (Cards), Next Sector to Fixed Heads (Disc)} to End-Message            |
| ??? POLQq    | Write {Alphanumeric Character (Cards), Next Sector to Fixed Heads (Disc)}                           |
| S43 INTQq    | Interrupt if device is busy (Director Entry) — may be S44?                                          |
| ?            |                                                                                                     |
| S43 PMAQq    | Forward Skip (Mag. Tape), Seek (Disc)                                                               |
| S43 PMDQq    | Rewind (Mag. Tape), Home Heads (Disc)                                                               |
| S43 PMEQq    | Backward Skip (Mag. Tape)                                                                           |
| S44 CTQq     | Clear any extant Transfer and set device unready/offline (Director-only)                            |
| S44 MANUALQq | Set device offline/unready                                                                          |
| S44 PARQq    | Test for Parity-Check or other device error                                                         |
| S44 BUSYQq   | Test for Busy device                                                                                |
| S44 PMBQq    | Test for Beginning of Tape window (Mag. Tape)                                                       |
| S44 PMCQq    | Test for Last Block                                                                                 |
| S44 PMFQq    | Test for End of Tape Warning (Mag. Tape), Test for End of Area (Disc)                               |
| ??? PMGQq    | Read C-Store (Director-only)                                                                        |
| ??? PMHQq    | Set Lock-Out (Director-only)                                                                        |
| ??? PMKQq    | Forward Skip, even parity, on IBM tape                                                              |
| ??? PMLQq    | Backward Skip, even parity, on IBM tape                                                             |

| <b>Interrupts</b> | <b>AC</b> | <b>MC</b>       | <b>Effect</b>                                       |
|-------------------|-----------|-----------------|-----------------------------------------------------|
| S45 OUT           | 2         | 3 (2.5 $\mu$ s) | System-call: $\downarrow$ CIA; enter Director state |
| LOV               | 2         | 3 (2.5 $\mu$ s) | $\downarrow$ CIA; enter Director state              |
| Others            | 2         | 3 (2.5 $\mu$ s) | $\downarrow$ NIA; enter Director state              |

| 'arithmetic', all S46 | AC       | Effect {MC takes 0μs, or 1μs if too far ahead of AC}                                                                                       |
|-----------------------|----------|--------------------------------------------------------------------------------------------------------------------------------------------|
| +, -                  | 1        | $r, l : I; \uparrow r, l; \downarrow l \pm r$                                                                                              |
| ABS                   | 1        | $r : I; \uparrow r; \downarrow  r $                                                                                                        |
| 0, DUMMY              | 1        | 'No-op'                                                                                                                                    |
| ERASE, REV            | 1        | Reorder top of nest: see §2.3.                                                                                                             |
| NEG                   | 1        | $r : I; \uparrow r; \downarrow -r$                                                                                                         |
| NOT                   | 1        | $r : W; \uparrow r; \downarrow \neg r$                                                                                                     |
| AND, OR               | 1        | $r, l : W; \uparrow r, l; \downarrow l \wedge r \{ \text{but for OR: } \downarrow l \vee r \}$                                             |
| ROUND                 | 1        | $r : D; \uparrow r; \downarrow (r \text{ rounded to } I)$                                                                                  |
| VR                    | 1        | Clear Overflow                                                                                                                             |
| =TR                   | 2        | $\uparrow l; \text{if } l < 0 \text{ then set Test Register end}$                                                                          |
| CONT                  | 2        | $r : D; \uparrow r; \downarrow (r \text{ contracted to } I)$                                                                               |
| DUP, CAB, PERM, ZERO  | 2        | Reorder top of nest: see §2.3.                                                                                                             |
| NEGD                  | 2        | $r : D; \uparrow r; \downarrow -r$                                                                                                         |
| NEV                   | 2        | $r, l : W; \uparrow r, l; \downarrow l \neq r \{ \text{Not Equivalent, i.e., exclusive or} \}$                                             |
| ROUNDH                | 2        | $r : I; \uparrow r; \downarrow (r \text{ rounded to } H) \{ \text{D24-D47 of result are undefined} \}$                                     |
| +D, -D                | 3        | $r, l : D; \uparrow r, l; \downarrow l \pm r$                                                                                              |
| NEGF                  | 3        | $r : F; \uparrow r; \downarrow -r$                                                                                                         |
| ROUNDFF               | 3        | $r : DF; \uparrow r; \downarrow (r \text{ rounded to } F)$                                                                                 |
| ROUNDHF               | 3        | $r : F; \uparrow r; \downarrow (r \text{ rounded to } FH) \{ \text{D24-D47 of result are undefined} \}$                                    |
| SIGN                  | 3        | $r, l : I; \uparrow r, l; \downarrow \text{if } l > r \text{ then } +1 \text{ elseif } l < r \text{ then } -1 \text{ else } 0 \text{ end}$ |
| STR                   | 3        | $r : I; \uparrow r; \downarrow (r \text{ stretched to } D): D0-D47 of result = r:D0, D48 = 0$                                              |
| ABSF                  | 4        | $r : F; \uparrow r; \downarrow  r  \{ \text{ABSF takes only } 1\mu\text{s if } r \geq 0 \}$                                                |
| DUPD, REVD            | 4        | Reorder top of nest: see §2.3.                                                                                                             |
| MAX                   | 4        | $r, l : I; \uparrow r, l; \text{if } l \geq r \text{ then } \downarrow r, l; \text{ set Overflow else } \downarrow l, r \text{ end}$       |
| SIGNF                 | 5        | $r, l : F; \uparrow r, l; \downarrow \text{if } l > r \text{ then } +1 \text{ elseif } l < r \text{ then } -1 \text{ else } 0 \text{ end}$ |
| STAND                 | 5+n      | $r : F; \uparrow r; \downarrow (r \text{ normalised})$                                                                                     |
| FIX                   | 6        | $r : F; y, x : I; \uparrow r; \downarrow y, x: r = y \times 2^x, -1 \leq y <+1$                                                            |
| MAXF                  | 6        | $r, l : F; \uparrow r, l; \text{if } l \geq r \text{ then } \downarrow r, l; \text{ set Overflow else } \downarrow l, r \text{ end}$       |
| FLOAT                 | 7+n      | $x : F; r, l : I; \uparrow r, l; \downarrow x = l \times 2^r: -128 \leq r \leq +127$                                                       |
| +F, -F                | 7+a+n    | $r, l : F; \uparrow r, l; \downarrow l \pm r$                                                                                              |
| FLOATD                | 8+n      | $x : DF; r : I; l : D; \uparrow r, l; \downarrow x = l \times 2^r, \text{unrounded: } -128 \leq r \leq +127$                               |
| NEGDF                 | 9+n      | $r : DF; \uparrow r; \downarrow -r, \text{unrounded}$                                                                                      |
| +DF, -DF              | 12+a+n Δ | $r, l : DF; \uparrow r, l; \downarrow l \pm r, \text{unrounded}$                                                                           |
| xD                    | 14       | $r, l : I; x : D; \uparrow r, l; \downarrow x = l \times r$                                                                                |
| ×                     | 15       | $r, l : I; x : D; \uparrow r, l; \downarrow D0-D47 \text{ of } (x = l \times r), \text{rounded}$                                           |
| xF                    | 15+n     | $r, l, x : F; \uparrow r, l; \downarrow x = l \times r$                                                                                    |
| xD                    | 16+n     | $r, l : F; x : DF; \uparrow r, l; \downarrow x = l \times r, \text{unrounded}$                                                             |
| xF                    | 18+a+n   | $r, l : F; x, s : DF; \uparrow r, l; \uparrow s; \downarrow x = l \times r + s, \text{unrounded}$                                          |
| BITS                  | 27       | $r : W; x : I; \uparrow r; \downarrow x = \# \text{ of 1-bits in } r$                                                                      |
| ÷F                    | 35+n     | $r, l, x : F; \uparrow r, l; \downarrow x = l \div r, \text{unrounded}$                                                                    |
| ÷DF                   | 36+n     | $x, r : F; l : DF; \uparrow r, l; \downarrow x = l \div r, \text{unrounded}$                                                               |
| ÷                     | 36+a+n   | $x, r, l : I; \uparrow r, l; \downarrow x = l \div r, \text{rounded: } -1 \leq x <+1$                                                      |
| ÷D                    | 36+a+n   | $x, r : I; l : D; \uparrow r, l; \downarrow x = l \div r, \text{rounded: } -1 \leq x <+1$                                                  |
| ÷I                    | 36+a+n   | $d, n, q, r : I; \uparrow d; \uparrow n; \downarrow q, r: n = d \times q + r,  r  <  d , rd > 0$                                           |
| ÷R                    | 36+a+n   | $f, x, r : I; l : D; \uparrow r, l; \downarrow f, x: l = r \times x + 2^{-47}f, -1 \leq x <+1$                                             |
| TOB                   | ≈ 2+4b   | $r, l : W; b : I; \uparrow r, l; \downarrow b = r \text{ converted to binary using radixes } l$                                            |
| FRB                   | ≈ 8+3b   | $r : I; l, c : W; \uparrow r, l; \downarrow c = r \text{ converted from binary using radixes } l$                                          |

**KEY:**

*a* is shifting time in excess of 1μs needed to align the operands

*n* is shifting time in excess of 1μs needed to produce a normalized result

*b* is the number of 1 bits in the numerical operand

Δ denotes a “variable time” instruction, whose timing depends on which operand has the larger exponent

## APPENDIX 2: KDF9 INSTRUCTION SET ENCODING

Operand coding symbols:

|            |                                                            |
|------------|------------------------------------------------------------|
| a          | A main store word address bit                              |
| b          | A non-decoded “don’t care” bit—should be set to 0          |
| h          | A halfword is to be added to the return address if $h = 0$ |
| i          | A bit of a 16-bit constant                                 |
| kkkk, qqqq | Q Store register numbers (0..15)                           |
| sss        | A instruction syllable number in 0..5; 6 and 7 invalid     |
| u          | Device is set unready on completion of operation           |

The first column gives the identifier of the MC microcode sequence that carries out the instruction.

| Instruction | \       | Bits: | 0,1 | 2,3 | 4 | 5-7 | 8-11 | 12-15 | 16-23    |
|-------------|---------|-------|-----|-----|---|-----|------|-------|----------|
| S1          | JSr     |       | 10  | 00  | a | sss | 1101 | aaaa  | aaaaaaaa |
| S2          | JrCqZ   |       | 10  | 10  | a | sss | qqqq | aaaa  | aaaaaaaa |
| S2          | JrCqNZ  |       | 10  | 11  | a | sss | qqqq | aaaa  | aaaaaaaa |
| S3          | Jr      |       | 10  | 00  | a | sss | 1011 | aaaa  | aaaaaaaa |
| S4          | JrCqNZS |       | 01  | 11  | 1 | 111 | qqqq | bbbb  |          |
| S5          | Jr=     |       | 10  | 01  | a | sss | 0001 | aaaa  | aaaaaaaa |
| S5          | Jr≠     |       | 10  | 00  | a | sss | 0001 | aaaa  | aaaaaaaa |
| S6          | Jr>Z    |       | 10  | 01  | a | sss | 0100 | aaaa  | aaaaaaaa |
| S6          | Jr<Z    |       | 10  | 00  | a | sss | 0100 | aaaa  | aaaaaaaa |
| S6          | Jr<Z    |       | 10  | 01  | a | sss | 0010 | aaaa  | aaaaaaaa |
| S6          | Jr≥Z    |       | 10  | 00  | a | sss | 0010 | aaaa  | aaaaaaaa |
| S6          | Jr=Z    |       | 10  | 01  | a | sss | 0110 | aaaa  | aaaaaaaa |
| S6          | Jr≠Z    |       | 10  | 00  | a | sss | 0110 | aaaa  | aaaaaaaa |
| S7          | JrV     |       | 10  | 01  | a | sss | 1000 | aaaa  | aaaaaaaa |
| S7          | JrNV    |       | 10  | 00  | a | sss | 1000 | aaaa  | aaaaaaaa |
| S7          | JrEN    |       | 10  | 01  | a | sss | 1010 | aaaa  | aaaaaaaa |
| S7          | JrNEN   |       | 10  | 00  | a | sss | 1010 | aaaa  | aaaaaaaa |
| S7          | JrEJ    |       | 10  | 01  | a | sss | 1100 | aaaa  | aaaaaaaa |
| S7          | JrNEJ   |       | 10  | 00  | a | sss | 1100 | aaaa  | aaaaaaaa |
| S7          | JrTR    |       | 10  | 01  | a | sss | 1110 | aaaa  | aaaaaaaa |
| S7          | JrNTR   |       | 10  | 00  | a | sss | 1110 | aaaa  | aaaaaaaa |
| S8          | EXIT    |       | 10  | 00  | a | 0h0 | 1111 | aaaa  | aaaaaaaa |
| S9          | EXITD   |       | 10  | 01  | 0 | 010 | 1111 | 0000  | 00000000 |

|     |       |    |    |   |     |      |      |
|-----|-------|----|----|---|-----|------|------|
| S10 | LINK  | 01 | 11 | 1 | 011 | 0000 | bbbb |
| S11 | =LINK | 01 | 11 | 1 | 100 | 0000 | bbbb |

|     |         |    |    |   |     |      |      |
|-----|---------|----|----|---|-----|------|------|
| S12 | M+Iq    | 01 | 10 | 0 | 000 | qqqq | bbbb |
| S13 | M-Iq    | 01 | 10 | 0 | 001 | qqqq | bbbb |
| S14 | NCq     | 01 | 10 | 0 | 010 | qqqq | bbbb |
| S15 | DCq     | 01 | 10 | 0 | 011 | qqqq | bbbb |
| S16 | Iq=1    | 01 | 10 | 0 | 100 | qqqq | bbbb |
| S17 | Iq=-1   | 01 | 10 | 0 | 101 | qqqq | bbbb |
| S18 | Iq=2    | 01 | 10 | 0 | 110 | qqqq | bbbb |
| S19 | Iq=-2   | 01 | 10 | 0 | 111 | qqqq | bbbb |
| S20 | MkTOQq  | 01 | 10 | 1 | 001 | kkkk | qqqq |
| S20 | IkTOQq  | 01 | 10 | 1 | 010 | kkkk | qqqq |
| S20 | IMkTOQq | 01 | 10 | 1 | 011 | kkkk | qqqq |
| S20 | CkTOQq  | 01 | 10 | 1 | 100 | kkkk | qqqq |
| S20 | CMkTOQq | 01 | 10 | 1 | 101 | kkkk | qqqq |
| S20 | CIkTOQq | 01 | 10 | 1 | 110 | kkkk | qqqq |
| S20 | QkTOQq  | 01 | 10 | 1 | 111 | kkkk | qqqq |

|     |    |    |    |   |     |      |      |
|-----|----|----|----|---|-----|------|------|
| S21 | Qq | 01 | 11 | 1 | 001 | qqqq | 111b |
| S22 | Cq | 01 | 11 | 1 | 001 | qqqq | 100b |
| S23 | Iq | 01 | 11 | 1 | 001 | qqqq | 010b |
| S24 | Mq | 01 | 11 | 1 | 001 | qqqq | 001b |

|     |     |    |    |   |     |      |      |          |
|-----|-----|----|----|---|-----|------|------|----------|
| S25 | SET | 11 | bb | b | 1b0 | iiii | iiii | iiiiiiii |
|-----|-----|----|----|---|-----|------|------|----------|

|     |     |    |    |   |     |      |      |
|-----|-----|----|----|---|-----|------|------|
| S26 | =Qq | 01 | 11 | 1 | 000 | qqqq | 1110 |
|-----|-----|----|----|---|-----|------|------|

|     |      |    |    |   |     |      |      |
|-----|------|----|----|---|-----|------|------|
| S26 | =Cq  | 01 | 11 | 1 | 000 | qqqq | 1000 |
| S26 | =Iq  | 01 | 11 | 1 | 000 | qqqq | 0100 |
| S26 | =Mq  | 01 | 11 | 1 | 000 | qqqq | 0010 |
| S27 | =RCq | 01 | 11 | 1 | 000 | qqqq | 1001 |
| S27 | =RIq | 01 | 11 | 1 | 000 | qqqq | 0101 |
| S27 | =RMq | 01 | 11 | 1 | 000 | qqqq | 0011 |
| S28 | =+Qq | 01 | 11 | 1 | 010 | qqqq | 111b |
| S29 | =+Cq | 01 | 11 | 1 | 010 | qqqq | 100b |
| S30 | =+Iq | 01 | 11 | 1 | 010 | qqqq | 010b |
| S31 | =+Mq | 01 | 11 | 1 | 010 | qqqq | 001b |

|     |                  |    |    |   |     |      |      |
|-----|------------------|----|----|---|-----|------|------|
| S32 | SHA $\pm n$      | 01 | 11 | 0 | 001 | nnnn | nnn1 |
| S32 | SHAD $\pm n$     | 01 | 11 | 0 | 010 | nnnn | nnn1 |
| S32 | $\times + \pm n$ | 01 | 11 | 0 | 011 | nnnn | nnn1 |
| S32 | SHL $\pm n$      | 01 | 11 | 0 | 100 | nnnn | nnn1 |
| S32 | SHLD $\pm n$     | 01 | 11 | 0 | 110 | nnnn | nnn1 |
| S32 | SHC $\pm n$      | 01 | 11 | 0 | 111 | nnnn | nnn1 |
| S33 | SHACq            | 01 | 11 | 0 | 001 | qqqq | bbb0 |
| S33 | SHADCq           | 01 | 11 | 0 | 010 | qqqq | bbb0 |
| S33 | $\times + Cq$    | 01 | 11 | 0 | 011 | qqqq | bbb0 |
| S33 | SHLCq            | 01 | 11 | 0 | 100 | qqqq | bbb0 |
| S33 | SHLDCq           | 01 | 11 | 0 | 110 | qqqq | bbb0 |
| S33 | SHCCq            | 01 | 11 | 0 | 111 | qqqq | bbb0 |

|     |     |    |    |   |     |      |      |
|-----|-----|----|----|---|-----|------|------|
| S34 | =K0 | 01 | 11 | 1 | 101 | 1000 | 0000 |
| S34 | =K1 | 01 | 11 | 1 | 101 | 0100 | 0000 |
| S34 | =K2 | 01 | 11 | 1 | 101 | 0010 | 0000 |
| S34 | =K3 | 01 | 11 | 1 | 101 | 0001 | 0000 |
| S35 | K4  | 01 | 11 | 1 | 110 | 0000 | 1000 |
| S35 | K5  | 01 | 11 | 1 | 110 | 0000 | 0100 |
| S35 | K7  | 01 | 11 | 1 | 110 | 0000 | 0001 |

|     |       |    |    |   |     |      |      |          |
|-----|-------|----|----|---|-----|------|------|----------|
| S36 | EaMq  | 11 | aa | a | 000 | qqqq | aaaa | aaaaaaaa |
| S36 | EaMqQ | 11 | aa | a | 010 | qqqq | aaaa | aaaaaaaa |

|     |         |    |    |   |     |      |      |
|-----|---------|----|----|---|-----|------|------|
| S37 | MkMq    | 01 | 00 | 0 | 000 | qqqq | kkkk |
| S37 | MkMqQ   | 01 | 00 | 0 | 010 | qqqq | kkkk |
| S37 | MkMqH   | 01 | 00 | 0 | 100 | qqqq | kkkk |
| S37 | MkMqQH  | 01 | 00 | 0 | 110 | qqqq | kkkk |
| S37 | MkMqN   | 01 | 00 | 1 | 000 | qqqq | kkkk |
| S37 | MkMqQN  | 01 | 00 | 1 | 010 | qqqq | kkkk |
| S37 | MkMqHN  | 01 | 00 | 1 | 100 | qqqq | kkkk |
| S37 | MkMqQHN | 01 | 00 | 1 | 110 | qqqq | kkkk |

|     |        |    |    |   |     |      |      |          |
|-----|--------|----|----|---|-----|------|------|----------|
| S38 | =EaMq  | 11 | aa | a | 001 | qqqq | aaaa | aaaaaaaa |
| S38 | =EaMqQ | 11 | aa | a | 011 | qqqq | aaaa | aaaaaaaa |

|     |          |    |    |   |     |      |      |
|-----|----------|----|----|---|-----|------|------|
| S39 | =MkMq    | 01 | 00 | 0 | 001 | qqqq | kkkk |
| S39 | =MkMqQ   | 01 | 00 | 0 | 011 | qqqq | kkkk |
| S39 | =MkMqH   | 01 | 00 | 0 | 101 | qqqq | kkkk |
| S39 | =MkMqQH  | 01 | 00 | 0 | 111 | qqqq | kkkk |
| S39 | =MkMqN   | 01 | 00 | 1 | 001 | qqqq | kkkk |
| S39 | =MkMqQN  | 01 | 00 | 1 | 011 | qqqq | kkkk |
| S39 | =MkMqHN  | 01 | 00 | 1 | 101 | qqqq | kkkk |
| S39 | =MkMqQHN | 01 | 00 | 1 | 111 | qqqq | kkkk |

|     |     |        |    |    |   |     |      |      |
|-----|-----|--------|----|----|---|-----|------|------|
| S40 |     | CLOQq  | 01 | 01 | 0 | 100 | qqqq | 001u |
| S40 |     | TLOQq  | 01 | 01 | 0 | 100 | qqqq | 010u |
| S40 | PIA | MFRQq  | 01 | 01 | 0 | 100 | qqqq | 000u |
| S40 | PIB | MFREQq | 01 | 01 | 0 | 101 | qqqq | 000u |
| S40 | PIC | RCQq   | 01 | 01 | 0 | 100 | qqqq | 100u |
| S40 | PID | RCEQq  | 01 | 01 | 0 | 101 | qqqq | 100u |
| S40 | PIE | MBRQq  | 01 | 01 | 0 | 110 | qqqq | 000u |
| S40 | PIF | MBREQq | 01 | 01 | 0 | 111 | qqqq | 000u |
| S?? | PIG | PIGQq  | 01 | 01 | 0 | 110 | qqqq | 100u |
| S?? | PIH | PIHQq  | 01 | 01 | 0 | 111 | qqqq | 100u |

|     |     |         |    |    |   |     |      |      |
|-----|-----|---------|----|----|---|-----|------|------|
| S41 | POA | MWQq    | 01 | 01 | 1 | 000 | qqqq | 000u |
| S41 | POB | MWEQq   | 01 | 01 | 1 | 001 | qqqq | 000u |
| S41 | POC | MLWQq   | 01 | 01 | 1 | 000 | qqqq | 100u |
| S41 | POD | MLWEQq  | 01 | 01 | 1 | 001 | qqqq | 100u |
| S42 | POE | MGAPQq  | 01 | 01 | 1 | 000 | qqqq | 110u |
| S42 | POF | MWIPEQq | 01 | 01 | 1 | 000 | qqqq | 010u |
| S?? | POG | POGQq   | 01 | 01 | 1 | 010 | qqqq | 000u |
| S?? | POH | POHQq   | 01 | 01 | 1 | 011 | qqqq | 000u |
| S?? | POK | POKQq   | 01 | 01 | 1 | 011 | qqqq | 100u |
| S?? | POL | POLQq   | 01 | 01 | 1 | 010 | qqqq | 100u |

|     |     |               |    |    |   |     |      |      |
|-----|-----|---------------|----|----|---|-----|------|------|
| S43 |     | INTQq         | 01 | 01 | 1 | 10b | qqqq | 0b1u |
| S44 |     | CTQq          | 01 | 01 | 0 | 000 | qqqq | 0000 |
| S44 |     | MANUALQq      | 01 | 01 | 0 | 000 | qqqq | 0001 |
| S44 |     | BUSYQq        | 01 | 01 | 0 | 000 | qqqq | 001u |
| S44 |     | PARQq         | 01 | 01 | 0 | 001 | qqqq | 000u |
| S43 | PMA | MFSKQq        | 01 | 01 | 1 | 100 | qqqq | 000u |
| S44 | PMB | MBTQq         | 01 | 01 | 0 | 000 | qqqq | 100u |
| S44 | PMC | MLBQq         | 01 | 01 | 0 | 000 | qqqq | 010u |
| S43 | PMD | MRWDQq        | 01 | 01 | 1 | 110 | qqqq | 100u |
| S43 | PME | MBSKQq        | 01 | 01 | 1 | 110 | qqqq | 000u |
| S44 | PMF | METQq         | 01 | 01 | 0 | 010 | qqqq | 000u |
| S?? | PMG | fetch C store | 01 | 01 | 0 | 010 | qqqq | 010u |
| S?? | PMH | set lock-out  | 01 | 01 | 1 | 000 | qqqq | 001u |
| S?? | PMK | 7T FSK even   | 01 | 01 | 1 | 100 | qqqq | 010u |
| S?? | PML | 7T BSK even   | 01 | 01 | 1 | 110 | qqqq | 010u |

**Note:** older versions of this document gave incorrect codes for PMG and PMH. These wrong guesses are implemented in versions of **ee9** prior to V2.1, and in the versions of kal3 included with those distributions.

|     |     |    |    |   |     |      |      |          |
|-----|-----|----|----|---|-----|------|------|----------|
| S45 | OUT | 10 | 00 | b | bbb | 1001 | bbbb | bbbbbbbb |
|-----|-----|----|----|---|-----|------|------|----------|

|     |                    |    |        |
|-----|--------------------|----|--------|
| S46 | no-op              | 00 | 000000 |
| S46 | VR                 | 00 | 000001 |
| S46 | =TR                | 00 | 000010 |
| S46 | BITS               | 00 | 000011 |
| S46 | xF                 | 00 | 000100 |
| S46 | xD <sup>F</sup>    | 00 | 000101 |
| S46 | Unused ?           | 00 | 000110 |
| S46 | x+F                | 00 | 000111 |
| S46 | NEGD               | 00 | 001000 |
| S46 | OR                 | 00 | 001001 |
| S46 | PERM               | 00 | 001010 |
| S46 | TOB                | 00 | 001011 |
| S46 | ROUNDH             | 00 | 001100 |
| S46 | NEV                | 00 | 001101 |
| S46 | ROUND              | 00 | 001110 |
| S46 | DUMMY              | 00 | 001111 |
| S46 | ROUND <sup>F</sup> | 00 | 010000 |
| S46 | ROUNDHF            | 00 | 010001 |
| S46 | -DF                | 00 | 010010 |
| S46 | +DF                | 00 | 010011 |
| S46 | FLOAT              | 00 | 010100 |
| S46 | FLOATD             | 00 | 010101 |
| S46 | ABS                | 00 | 010110 |
| S46 | NEG                | 00 | 010111 |
| S46 | ABSF               | 00 | 011000 |
| S46 | NEGF               | 00 | 011001 |
| S46 | MAX                | 00 | 011010 |
| S46 | NOT                | 00 | 011011 |
| S46 | xD                 | 00 | 011100 |
| S46 | x                  | 00 | 011101 |
| S46 | -                  | 00 | 011110 |
| S46 | SIGN               | 00 | 011111 |

|     |         |    |        |
|-----|---------|----|--------|
| S46 | Illegal | 00 | 100000 |
| S46 | ZERO    | 00 | 100001 |
| S46 | DUP     | 00 | 100010 |
| S46 | DUPD    | 00 | 100011 |
| S46 | ÷I      | 00 | 100100 |
| S46 | FIX     | 00 | 100101 |
| S46 | Illegal | 00 | 100110 |
| S46 | STR     | 00 | 100111 |
| S46 | CONT    | 00 | 101000 |
| S46 | REVD    | 00 | 101001 |
| S46 | ERASE   | 00 | 101010 |
| S46 | -D      | 00 | 101011 |
| S46 | AND     | 00 | 101100 |
| S46 | Illegal | 00 | 101101 |
| S46 | +       | 00 | 101110 |
| S46 | +D      | 00 | 101111 |
| S46 | ÷       | 00 | 110000 |
| S46 | ÷D      | 00 | 110001 |
| S46 | ÷F      | 00 | 110010 |
| S46 | ÷DF     | 00 | 110011 |
| S46 | ÷R      | 00 | 110100 |
| S46 | REV     | 00 | 110101 |
| S46 | CAB     | 00 | 110110 |
| S46 | FRB     | 00 | 110111 |
| S46 | STAND   | 00 | 111000 |
| S46 | NEGDF   | 00 | 111001 |
| S46 | MAXF    | 00 | 111010 |
| S46 | Illegal | 00 | 111011 |
| S46 | +F      | 00 | 111100 |
| S46 | -F      | 00 | 111101 |
| S46 | Illegal | 00 | 111110 |
| S46 | SIGNF   | 00 | 111111 |

### APPENDIX 3: THE I/O INSTRUCTIONS

In the following tables, blank cells are to be taken as inheriting the *common* semantics. Where something is shown in a cell it overrides the *common* semantics.

#### INPUT INSTRUCTIONS

|               | PIA | PIB | PIC  | PID  | PIE  | PIF  | PIG  | PIH  |
|---------------|-----|-----|------|------|------|------|------|------|
| <i>common</i> | R   | RE  | CR   | CRE  | =PIA | =PIB | =PIC | =PID |
| CR            | BR  | BRE | CBR  | CBRE | AR   | ARE  | CAR  | CARE |
| DR            |     |     | =PIA | =PIB |      |      | =PIA | =PIB |
| FD            |     |     | FR   | FRE  | RN   | RNE  | FRN  | FRNE |
| MT EE         |     |     | =PIA | =PIB | <R   | <RE  | =PIE | =PIF |
| MT IBM        | OR  | VR  | =PIA | =PIB | O<R  | V<R  | =PIE | =PIF |
| SI            |     |     |      |      | XR   | XRE  |      |      |
| TR            |     |     |      |      |      |      |      |      |
| FW            |     |     |      |      |      |      |      |      |

#### OUTPUT INSTRUCTIONS

|               | POA | POB  | POC  | POD  | POE  | POF  | POG | POH | POK  | POL |
|---------------|-----|------|------|------|------|------|-----|-----|------|-----|
| <i>common</i> | W   | WE   | CW   | CWE  | LIV  | LIV  | UND | UND | UND  | UND |
| CP            | BW  | BWE  | CBW  | CBWE | =POC | =POA | AW  | AWE | CAWE | CAW |
| DR            |     |      | =POA | =POB | WZ   | =POE |     |     |      |     |
| FD            |     |      | FW   | FWE  | =POC | =POA | WN  | WNE | FWNE | FWE |
| GP            |     | =POA |      | =POC | NOP  | NOP  |     |     |      |     |
| LP            |     |      |      |      | ???  | ???  | ??? | ??? | ???  | ??? |
| MT EE         |     |      | WL   | WLE  | GAP  | WIPE |     |     |      |     |
| MT IBM        | OW  | VW   | OT   | VT   | GAP  | WIPE |     |     |      |     |
| SI            |     |      |      |      | CGAP | WGAP |     |     |      |     |
| TP            |     |      |      |      | CGAP | WGAP |     |     |      |     |
| FW            |     |      |      |      |      |      |     |     |      |     |

#### DEVICE CONTROL INSTRUCTIONS

|               | PMA | PMB  | PMC  | PMD | PME | PMF  | PMG | PMH | PMK | PML |
|---------------|-----|------|------|-----|-----|------|-----|-----|-----|-----|
| <i>common</i> | LIV | NOP  | NOP  | LIV | LIV | NOP  | RCS | SLO | UND | UND |
| CP            |     |      |      |     |     |      |     |     |     |     |
| CR            |     | TRC? |      |     |     |      |     |     |     |     |
| DR            |     |      |      |     |     |      |     |     |     |     |
| FD            | S   |      |      | P   | UND | EOA? |     |     |     |     |
| GP            |     | GPA? |      |     |     |      |     |     |     |     |
| LP            |     |      |      |     |     |      |     |     |     |     |
| MT EE         | >>  | BOT? | LBL? | <<< | <<  | ETW? |     |     |     |     |
| MT IBM        | O>> | BOT? | LBL? | <<< | O<< | ETW? |     |     | V>> | V<< |
| SI            |     | T8T? | T8T? |     |     |      |     |     |     |     |
| TP            |     | GPA? |      |     |     |      |     |     |     |     |
| TR            |     | T8T? |      |     |     |      |     |     |     |     |
| FW            |     |      |      |     |     |      |     |     |     |     |

**KEY:**

|               |                                                                                       |
|---------------|---------------------------------------------------------------------------------------|
| <i>common</i> | these effects are common to all devices, unless over-ridden by a specific table entry |
| CP            | card punch                                                                            |
| CR            | card reader                                                                           |
| DR            | drum store                                                                            |
| FD            | fixed disc store                                                                      |
| GP            | graph plotter                                                                         |
| LP            | line printer                                                                          |
| MT EE         | EE magnetic tape deck                                                                 |
| MT IBM        | IBM-compatible 7-track magnetic tape deck                                             |
| SI            | standard interface buffer                                                             |
| TP            | paper tape punch                                                                      |
| TR            | paper tape reader                                                                     |
| FW            | Flexowriter monitor typewriter                                                        |

|                        |                                                                                                 |
|------------------------|-------------------------------------------------------------------------------------------------|
| <b>A</b>               | alphanumeric (converted to/from all 12 card rows in each column)                                |
| <b>B</b>               | binary (direct to/from 6 card rows, twice in each column)                                       |
| <b>C</b>               | character (each character transferred is stored in the least significant 6 or 8 bits of a word) |
| <b>E</b>               | stopping on an End Message character                                                            |
| <b>F</b>               | using the fixed heads                                                                           |
| <b>N</b>               | next sector                                                                                     |
| <b>O</b>               | in odd parity                                                                                   |
| <b>P</b>               | clear disc head positions                                                                       |
| <b>R</b>               | read                                                                                            |
| <b>S</b>               | seek disc heads                                                                                 |
| <b>T</b>               | write tape mark                                                                                 |
| <b>V</b>               | in even parity                                                                                  |
| <b>W</b>               | write                                                                                           |
| <b>X</b>               | without parity                                                                                  |
| <b>Z</b>               | zeros                                                                                           |
| <b>&lt;</b>            | backward (for read operation)                                                                   |
| <b>&gt;&gt;</b>        | forward skip up to $Mq$ blocks                                                                  |
| <b>&lt;&lt;</b>        | backward skip up to $Mq$ blocks                                                                 |
| <b>&lt;&lt;&lt;</b>    | rewind the tape to before its first block                                                       |
| <b>=P<sub>xy</sub></b> | having the same effect as the P <sub>xy</sub> order for this device                             |
| <b>CGAP</b>            | punch a gap of length $Mq$ / 10 inches                                                          |
| <b>WGAP</b>            | punch a gap? ('word gap'; undocumented)                                                         |
| <b>GAP</b>             | erase an inter-block gap of $Mq$ words                                                          |
| <b>WIPE</b>            | erase a bigger inter-block gap than GAP                                                         |
| <b>BOT?</b>            | Test Register := tape at Beginning Of Tape window                                               |
| <b>EOA?</b>            | Test Register := disc transfer reached end of area                                              |
| <b>ETW?</b>            | Test Register := tape at End Tape Warning                                                       |
| <b>GPA?</b>            | Test Register := a graph plotter is attached to the buffer and not a tape punch                 |
| <b>LBL?</b>            | Test Register := tape at Last Block marker                                                      |
| <b>T8T?</b>            | Test Register := device in 8-channel mode                                                       |
| <b>TRC?</b>            | Test Register := recheck switch is set on the card reader                                       |
| <b>RCS</b>             | Director-only (LIV caused in program mode): 'read C store'                                      |
| <b>SLO</b>             | Director-only (LIV caused in program mode): 'set lock-outs'                                     |
| <b>LIV</b>             | Lock-In Violation (always)                                                                      |
| <b>NOP</b>             | no operation                                                                                    |
| <b>UND</b>             | 'undefined'                                                                                     |
| <b>???</b>             | undocumented                                                                                    |

#### APPENDIX 4: THE KDF9 CHARACTER SETS AND CODES

|              |             |     |         |       |     |     |     |     |
|--------------|-------------|-----|---------|-------|-----|-----|-----|-----|
| Line printer | SP          | NLC | CRLF/LS | FF/PC | NLC | NLC | %   | '   |
| Normal Case  | SP          | NLC | CRLF/LS | FF/PC | HT  | NLC | CS  | CN  |
| Shift Case   | SP          | NLC | CRLF/LS | FF/PC | HT  | NLC | CS  | CN  |
| Card rows    | <i>none</i> | Y68 | Y28     | 028   | 058 | Y58 | 048 | Y78 |
| Octal code   | 00          | 01  | 02      | 03    | 04  | 05  | 06  | 07  |

|              |     |     |     |     |     |     |     |    |
|--------------|-----|-----|-----|-----|-----|-----|-----|----|
| Line printer | :   | =   | (   | )   | £   | *   | ,   | /  |
| Normal Case  | NLC | /  |
| Shift Case   | NLC | :  |
| Card rows    | 48  | 38  | X58 | X28 | X38 | X48 | 038 | 01 |
| Octal code   | 10  | 11  | 12  | 13  | 14  | 15  | 16  | 17 |

|              |    |    |    |    |    |    |    |    |
|--------------|----|----|----|----|----|----|----|----|
| Line printer | 0  | 1  | 2  | 3  | 4  | 5  | 6  | 7  |
| Normal Case  | 0  | 1  | 2  | 3  | 4  | 5  | 6  | 7  |
| Shift Case   | ↑  | [  | ]  | <  | >  | =  | ×  | ÷  |
| Card rows    | 0  | 1  | 2  | 3  | 4  | 5  | 6  | 7  |
| Octal code   | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 |

|              |    |    |     |           |     |    |    |     |
|--------------|----|----|-----|-----------|-----|----|----|-----|
| Line printer | 8  | 9  | NLC | <u>10</u> | ;   | +  | -  | .   |
| Normal Case  | 8  | 9  | —   | <u>10</u> | ;   | +  | -  | .   |
| Shift Case   | (  | )  | —   | £         | ;   | ≠  | *  | ,   |
| Card rows    | 8  | 9  | 068 | X68       | Y48 | Y  | X  | Y38 |
| Octal code   | 30 | 31 | 32  | 33        | 34  | 35 | 36 | 37  |

|              |     |    |    |    |    |    |    |    |
|--------------|-----|----|----|----|----|----|----|----|
| Line printer | NLC | A  | B  | C  | D  | E  | F  | G  |
| Normal Case  | NLC | A  | B  | C  | D  | E  | F  | G  |
| Shift Case   | NLC | a  | b  | c  | d  | e  | f  | g  |
| Card rows    | 078 | Y1 | Y2 | Y3 | Y4 | Y5 | Y6 | Y7 |
| Octal code   | 40  | 41 | 42 | 43 | 44 | 45 | 46 | 47 |

|              |    |    |    |    |    |    |    |    |
|--------------|----|----|----|----|----|----|----|----|
| Line printer | H  | I  | J  | K  | L  | M  | N  | O  |
| Normal Case  | H  | I  | J  | K  | L  | M  | N  | O  |
| Shift Case   | h  | i  | j  | k  | l  | m  | n  | o  |
| Card rows    | Y8 | Y9 | X1 | X2 | X3 | X4 | X5 | X6 |
| Octal code   | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 |

|              |    |    |    |    |    |    |    |    |
|--------------|----|----|----|----|----|----|----|----|
| Line printer | P  | Q  | R  | S  | T  | U  | V  | W  |
| Normal Case  | P  | Q  | R  | S  | T  | U  | V  | W  |
| Shift Case   | p  | q  | r  | s  | t  | u  | v  | w  |
| Card rows    | X7 | X8 | X9 | 02 | 03 | 04 | 05 | 06 |
| Octal code   | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 |

|              |    |    |    |     |     |     |     |    |
|--------------|----|----|----|-----|-----|-----|-----|----|
| Line printer | X  | Y  | Z  | NLC | NLC | NLC | NLC | Ø  |
| Normal Case  | X  | Y  | Z  | NLC | NLC | →   | NLC | Ø  |
| Shift Case   | x  | y  | z  | NLC | NLC | →   | NLC | Ø  |
| Card rows    | 07 | 08 | 09 | 58  | 68  | 78  | X78 | 28 |
| Octal code   | 70 | 71 | 72 | 73  | 74  | 75  | 76  | 77 |

#### KEY:

SP is (blank) Space; CRLF is Carriage Return Line Feed (i.e. New Line); PC is Page Change (i.e. Form Feed); HT is Horizontal Tab; CS is Case Shift; CN is Case Normal. NLC indicates a non-legible character (always suppressed by the line printer); and Ø represents the filler character, suppressed by all legible output devices in normal transfer modes.

The underline ‘\_’ does not advance the Flexowriter carriage and so is over-printed by the following character; it is used to represent Algol 60 ‘publication language’ basic symbols, e.g., ‘begin’, as ‘begin’.

The ‘10’ is the single-character exponent delimiter used in Algol 60’s real number syntax.

Punched cards have 80 columns of 12 rows. Rows 0-9 represent the decimal digits with a single punched hole. The other two rows are called X, above 0, and Y, at the top (on some non-KDF9 systems these are known as 11, just above 0, and 12 or 10 at the top). Most non-numeric characters are represented by combinations of two or three holes, one from rows Y, X, or 0, and the others in rows 1 through 8.

## APPENDIX 5: THE KDF9 GRAPH PLOTTER CODES

| Action                          | Decimal | Octal | Binary |
|---------------------------------|---------|-------|--------|
| None                            | 0       | #0    | 000000 |
| Step paper backwards            | 1       | #1    | 000001 |
| Step paper forwards             | 2       | #2    | 000010 |
| Step pen right                  | 4       | #4    | 000100 |
| Step pen right, paper backwards | 5       | #5    | 000101 |
| Step pen right, paper forwards  | 6       | #6    | 000110 |
| Step pen left                   | 8       | #10   | 001000 |
| Step pen left, paper backwards  | 9       | #11   | 001001 |
| Step pen left, paper forwards   | 10      | #12   | 001010 |
| Lower pen                       | 16      | #20   | 010000 |
| Raise pen                       | 32      | #40   | 100000 |

All other 6-bit character codes represent invalid plotter commands.

*The plotter command codes in [EEC69] Appendix 6, §5, p.302 are wrong, as there is an obvious error whereby 11 plotting codes are claimed but only 9 are given, the last of them being inconsistent with the others. The codes listed above have been confirmed by the evidence of other computers that used the same plotter.*

## REFERENCES AND FURTHER READING

[Davis60] *The English Electric KDF9 Computer System*; G.M. Davis;  
BCS Computer Bulletin, Vol. 4 No. 3; 1960.

[EEC62a] *KDF9: Basic Organisation of Controls; and Instruction Timings*; J.R. Lucking and J.P. O'Neil;  
Report K/GD.y.73, The English Electric Company Limited; 9 July 1962.

For the above document, and many others, see: [www.findlayw.plus.com/KDF9/Documents/](http://www.findlayw.plus.com/KDF9/Documents/)

[EEC62b] *Notes on KDF 9 Main Control*; J.A. Edwards;  
Report K/GD.u.293, The English Electric Company Limited; 7 September 1962.

[EEC62c] *KDF 9—Transformer Sequence Units*; J.R. Lucking;  
Report K/GD.y.81, The English Electric Company Limited; 15 November 1962.

For the above two documents, see: [sw.ccs.bcs.org/KDF9/Wichmann/techdoc.html](http://sw.ccs.bcs.org/KDF9/Wichmann/techdoc.html)

[ICL69] *KDF9 Programming Manual*  
Publication 1003 mm, 2nd Edition, International Computers Limited; October 1969.

[\*FindlayBM] *The KDF9 and Benchmarking*; W. Findlay; 2020.

[\*FindlayEE] *The English Electric KDF9*; W. Findlay; 2020.

[\*FindlayKB] *The KDF9: a Bibliography*; W. Findlay; 2020.

[\*FindlaySW] *The Software of the KDF9*; W. Findlay; 2020.

[\*FindlayUG] *Users' Guide for ee9: An English Electric KDF9 Emulator*; W. Findlay; 2020.

\* For the above five documents, see: [www.findlayw.plus.com/KDF9/](http://www.findlayw.plus.com/KDF9/)

[Haley62] *The KDF9 Computer System*; A.C.D. Haley;  
Proceedings of the Fall Joint Computer Conference, AFIPS Conference Proceedings, Vol.22; 1962.

[Longbottom]

[www.webarchive.org.uk/wayback/archive/20130303230703/www.roylongbottom.org.uk/whetstone.htm](http://www.webarchive.org.uk/wayback/archive/20130303230703/www.roylongbottom.org.uk/whetstone.htm)  
Roy Longbottom.

[UofI58] *On the Design of a Very High-Speed Computer*;

Report No. 80 2<sup>nd</sup> Edition, University of Illinois Graduate College Digital Computer Laboratory; April 1958.

For the above document, see:

[bitsavers.trailing-edge.com/pdf/univOfIllinoisUrbana/illiac/ILLIAC\\_II](http://bitsavers.trailing-edge.com/pdf/univOfIllinoisUrbana/illiac/ILLIAC_II)

[WR67] *Competition for memory access in the KDF9*; C.S. Wallace and B.G. Rowswell;  
Computer Journal, Vol.10, pp. 64-68; 1967.